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PRKFA(’E 


The title of this volume is an abl)re,viation for the more 
properly desrriptivc one; “Topics in the theory of ai>i)roxi- 
mcation It is a l)rief essay in a field on which an encj’clopedia 
might be written. On the personal .side, it is an acconnt 
of certain aspects and ramifications of a problem to which 1 
was introduced at an (;arly stage, and which has given direction 
to my reading and study ever since. 

One day about twenty years ago 1 was admitted to the 
study of Professor Landau, seeking advice as to a subji'ct 
for a thesis. After some preliminary inquiries as to my 
experiejice and preferences, he handed me a long sheet of 
paper, and directed me to take notes as he enumerated some 
dozen or fifteen topics in various fields of analysis and number 
theory, with a few words of explanation of each. He told 
me to think about them for a few days, and to select one 
of them, or any other problem of my own choosing, Avith the 
single reservation that I should not prove Fermat’s theorem, 
an injunction which J have observed faithfully. Guided partly 
by natural inclination, perhaps, and partly by recollection 
of a course on methods of approximation which 1 had taken 
with Professor Bdcher a few years earlier, 1 committed myself 
to one of the topics which Landau had proposed, an investi- 
gation of the degree of approximation with which a given 
continuous function can be represented by a polynomial of 
giveh degree. When I reported my choice, he said meditative- 
ly, in words which I remember vividly in substance, if not 
perfectly as to idiom: “Das ist ein schOnes Thema, ich beneidc 
Sie um das Thema . . . Neiii, ich beiieide Sie nicht, aber es 
ist ein wunderschfines Thema!” It is in fact a problem which 
admits a surprising variety of interesting developments on 
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PREFACE 


its own account, and offei's a natural avenue of approach 
to a number of fields of still broader importance. 

Although delayed in its completion by the conflict of other 
duties, the following exposition is substantially in the form 
in which it was projected at the time of the Colloquium 
lectui-es in 1925, and presented in abstract in the lectures 
themselves. One section, on the vector analysis of function 
space, originally designed for inclusion in the Colloquium, 
has meanwhile been published separately instead. The sections 
which had been written at full length in September, 1925 
— practically the whole of the first chapter, parts of the 
second, and most of the third — have been left unchanged, 
except in minor details. The elementary account of Legendre 
series in Chapter I, for example, was written before the appear- 
ance of the admirable article on the subject by M. H. Stone 
in vol. 27 of the Annals of Mathematics. A few other articles 
published since 1925 are mentioned in the text. 

For the most part, however, citations of the literature have 
been omitted. The preparation of a really adequate bibliog- 
raphy would have been a task of such magnitude as to 
delay the publication indefinitely. References to some of 
the most important papers of not too recent date are con- 
tained iu my thesis (Gottingen, 1911) and in my report 
on TJie general theory of approximation by polynomials and 
trigonometric sums in vol. 27 of the Bulletin of the American 
Mathematical Society. Among publications in book form 
supplementing the material given here, mention should be 
made of Borel’s Legions sur les fonctions de variables reelles 
et les diveloppements en series de 2)olyfiomes, de la Vallde 
Poussin’s Leixms stir V approximation des fonctions d’une 
variable rfielle, and S. Bernstein’s Lemons sur les propriites 
extremales et la meilleiire approximation des fonctions analytiques 
d’une variable rtelle, all appearing in the Borel series. As 
to the content of these lectures themselves, there are many 
points where it would be difficult now to recall the original 
sources either of specific results and proofs or of suggestions 
as to method. To the extent that the work is my own, some 



PREFACE 


V 


parts have been published previously, in my thesis, in various 
articles in the Transactions of the American Mathematical 
Society, and elsewhere; other parts are now offered in print 
for the first time. Numerous detailed acknowledgments, not 
repeated here, have been made in the pages of the earlier 
publications. In connection with Chapter IV, reference should 
still be made to the work of Faber on trigonometric inter- 
polation in his memoir tlher stetige Funktionen (zweite Ah- 
handlting) in vol. 69 of the Mathematische Annalen. My 
acquaintance with the statistical formulas discussed in Chapter V. 
which might have come from any of a variety of sources, was 
in fact mostly obtained from Yule’s Introductim to the The- 
ory of Statistics. The lemma on which the method of Chapter III 
depends is derived from the most important single memoir in 
the literature on degj-ee of approximation, S. Bernstein's epoch- 
making prize essay of 1912, with which the present work also 
has other points of contact. And in conclusion it should be 
said that my study of the problem has been dominated from 
the beginning not only by the influence of my own teachers, 
but also by the writings of Lebesgue and de la Vallee Poussin. 

October 1, 1929 


Dunham Jackson 
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CHAPTER I 


CONTINUOUS FUNCTIONS 

Introduction 

Weierstrass first enunciated the theorem that an arbitrary 
continuous function can be approximately represented by a 
polynomial with any assigned degree of accuracy. The 
theorem may be stated with precision in the following form: 

If f(x) is a given function, contimious for (i ^ ^ ^ unci 
if € is a given positive quantity, it is always 2 )ossible to define 
a polynomial P{x) such that 

\f{x)-P{x)\<e 

for a < X ^ b. 

To Weierstrass is due also the corresponding theorem on 
approximation by means of trigonometric sums: 

If ^ given function of period 2 tt, continuous for 

all real values of x, and if e is a given positive quant ity^ it 
is always possible to define a trigonometric sum T [x) such that 

\f{x)-T{x)\<e 
for all real values of x. 

By a polynomial is meant an expression of the form 
aQ-\- ai X aoo? • • • + 

This expression will be said to represent a polynomial of 
the nth degree, not only when an is different fiom zero, but, 
in distinction from the usage which prevails in some parts 
of algebra, also when Un = 0. That is to say, the words 
'^polynomial of the nth degree” will be used in place of the 
longer expression "polynomial of the nth degree at most”. 
Even the case of identical vanishing is not excluded. A trig- 
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THE THEORY OF APPROXIMATION 


onometric mm, or more specifically a trigonometric sum of 
the nth order, is an expression of the form 

cos .r + ^2 cos 2./*+ +anCOsrta: 
fhi sin xfho sin 2./'f ••• fhn sin nx. 

The definition is inclusive once more; the simultaneous vanish- 
ing of Oa and hn is not ruled out. 

These two types of approximating function show a persistent 
and fundamental similarity in their behavior, on which 
differences of more or less significance are from time to time 
superimposed. Simplicity of statement and proof will favor 
sometimes one and sometimes the other. 

It is readily seen that the number of terms required to 
yield a specified degree of approximation, or, under the 
converse aspect, the degree of approximation attainable with 
a specified number of terms, will be related to the properties 
of continuity of f{x). It is the purpose of the next pages 
to trace out this relationship in some detail. 

I. Approximation by trigonometric sums 

For a considerable body of results, the following theorem 
may be regarded as fundamental: 

Theohp:m L IfJ\x) is a function of 'period 2n, such that 

L/ C^’2) ./ ('^1) 1 ' X2 Xi I 

for all real values of andx 2 , ^ being a constant, there will 
exist for every positive integral value of n a trigonometric 
sum Tn{x), of the nth order, such that, for all real values of x, 

:/(x)-7’„(x)| < , 

n 

where K is an absolute const aiit, depending neither on x, nor 
on n, nor on A, nor on any further specification with regard 
to the function f{x)- 

In the proof of the theorem, use will be made of the 
following 
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Lemma. If m is a positive integer ^ the expression 

sin^ (inxl2) 
sm^{x/2) 

is a trigonometric sum in x, of order 2 m — 2 . 

Because of the identity 

cosp^ccosg^r = -~[cos( 2 > + ^)^4- cos — q)x] 

and the others of similar type, it is seen at once that the 
product of two trigonometric sums, of orders 72, and 
respectively, is a trigonometric sum of order ti, -j- • It is 

sufficient for the purpose in hand, therefore, to recall any 
one of the numerous proofs of the well-known fact that 

sin®(7wa:/2) 1 — costnx 

sin*(a;/2) 1 — cosr 

is a trigonometric sum of order m — 1; its square will then 
be a sum of order 2m — 2. The fact that 1 — cosmrr is 
equal to the product of 1 — cosx by a trigonometric sum of 
order tn — 1 appears, for example, from the identities 

m— 1 

1 — cosm^r == ^ [cos^^a: — cos(|7 4^ l)x‘], 

2 ) (I 

cos px — co^(p + l)a; 

= (1 — cosa:) — ^ [cos(^ — l)a: — 2cosg'a' 

+ cos(<7 + 1)*^]. 

cos(g — l)a: — 2 cos^a; + cos(g4- 1)^‘ 

[cos(g — l)a; + cosf^ + 1)^’] — 2 cosga^ 
= 2 cosqx cosa: — 2cosqx 
= — 2cos^jfa:(l — cosa;). 

To proceed with the proof of the theorem, let 
Fm{u) = = hmJ^^^^^f{^-{''2u)Fm(ti)dUy 


1 
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THE THEORY OP APPROXIMATION 


where m is any positive integer, and hm is defined by the 
equation 



Bj' means of the substitution x-\-2u = v, the expression 
for lm{x) is transformed into 

Both factors in the last integrand have the period 27t with 
regard to v, so that the value of the integral is unchanged 
if the interval of integration is replaced by any other interval 
of length 2 71. In particular, 

Im{x) = Y hm. /(v) Fm ^)] ’ 

The expression Fm[hiv—x)], by the Lemma above, is a 
trigonometric sum of order 2 m — 2 in {v — x), and may be 
regarded as a trigonometric sum of the same order in x. 
with coefficients which are trigonometric functions of v. The 
whole integrand is a trigonometric sum in x with coefficients 
which are continuous functions of Vj and Im{x) therefore is 
a trigonometric sum of order 2 m — 2 in with constant co- 
efficients. The proof that this sum is an approximate rep- 
resentation of f{x)j when m is large, will be based on the 
original representation of Im ix). 

Let the equation defining hm be multiplied by hmf{x). 
Since f{x) is a constant as far as u is concerned, it may be 
placed under the sign of integration, so that 

fix) = hmj^^i^ fix) Fm ill) du. 

Consequently 

Imix)—fix) = hm^^^^[fix-{-2u)—fix)]Fmiu)du. 

By the hypothesis imposed on fix), 

\fix-\r2u')—fix)\ 2i.\u\. 
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Hence 


or, since Fm{n) and \u\Fm(u) are even functions of u, 

r /2 

uFmi'u)du 


\Im(x)—f{x)\^^lhm uFm{u)du^-2X 


To anticipate the conclusion of the proof, let 


r*7i/2 

Fmiii)du 


sin^j 


dty 


C2 


^ sin' 

Jo 


t . 

TT" dt. 


These quantities are merely numerical constants. Is is clear 
that each integi'and approaches a limit for ^ — 0, and that 
the improper integral defining c# is convergent. 

By the use of the fact that 0 < sin u < u for 0 < u < 7r/2, 
and the substitution mu t, it k recognized that 




1 gii^ 

m Jo 
m 


On the other hand, (sinu)/?^ decreases monotonically as u 
goes from 0 to 7r/2, so that 

sin?/ ^ sin(7r/2) 2 1 tt 1 

u " (7r/2) 71 ' m\u 2 u 

throughout the interior of this interval. Hence 



From these relations it follows that 


IrAx)-f{x)\ 


71 ^ Cg ^ 

S(\ m 
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Now let n be an arbitrary integer, and let m be taken 
equal to ^w+1 or according as n is even or odd. 

In either case, 2»i — 2<n<2Mi. Let the corresponding 
expression Imix) be denoted by Tn{x). Then Tn{x) is a 
trigonometric sum of the nth order (it will be remembered 
that this is understood to mean of the nth order at mmt. 
according to the more usual terminology), and, since \lm-^2/n, 




It* Cj A 
4ri 


7v;. 

'll 


if K is tsiken equal to -T*r»/(4c,). Thus the proof of the 
theorem is completed. 

So much has been conceded to simplicity of outline, in 
building up the above inequalities, that the final upper limits 
are quite unnecessarily large, giving little indication of the 
actual magnitude of the quantities that precede. It will 
add a little to the definiteness of the conclusion to point 
out that ci>(2/jr)*. since (sin<)/0 2/7r throughout the 
interior of the interval of integi'ation. while 



With more attention to detail, however, the estimate can be 
cut very much closer. The theorem is actually true with 
K — instead of the value adopted above, or even with 
a somewhat smaller value of K. On the other hand, it can 
be shown that the statement is not generally true with a value 
of K smaller than 7x12. 

To pass on to a more general theorem, let /(*) be an 
arbitrary continuous function of period 27 r, and let w(d) be 
the maximum of l/(a^*)--/(a;,) ! for |a:g— x, | < d. The 
function w(d) has been called by de la Vallee Poussin the 
modidm of voxitinuity of /(.r). With the word maximum 
replaced by hast upper hmmd, it can be defined for any 
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bounded function, whether continuous or not. The character- 
istic property of a uniforml}" continuous function is that 
~ 0 . 

Let fp{x) be the continuous function of period 2tc which 
takes on the same values as/(a^) at the points 


271 

n 


- 71 -^- 


47r 


n 


71- 


27T 




and is linear from each point of this set to the next. The 
p:raph of y(x) is a broken line, no segment of which has 
a slope greater than (Mi{27thi)l {27t In) in absolute value. 
In analytical language, y (x) satisfies the hypothesis of 
Theorem 1, with 

P _ CO (271 In) 

27iln 

For every positive integral value of n^ therefore, there is 
a trigonometric sum Tn(x), of the nth order, such that 

\<p(x)-Tn(x)] < 

On the other hand, any specified value of x differs by less 
than 27r/n from one of those for which / and f are by de- 
finition equal to each other; neither f{x)nor^{x) can differ 
by more than ft)(2 7r/n) from the corresponding common value; 
and hence 

\/{x) — f{x)\ ^ 

for all values of x. If the quantity Kl {2 7x)-\-2 is denoted 
by the last two inequalities may be combined to yield 
the following statement: 

Theorem II. If f{x) is a continuous function of period 
2rtj ivitli modulus of continuity co{S)^ ihei'e exists for every 
positive integral value of n a trigonometric sum Tn(x), of 
the, nth order ^ such thatj for all real values of .r, 

\f{x)-T„{x)\ < K'io 
ivhere K' is an absolute constant. 
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While this theorem is applicable to any continuous function, 
it involves the modulus of continuity in the inequality which 
forms the essence of its conclusion. It can be shown that 
the assignment of an outer limit of error for an arbitrary 
continuous function, without some dependence on properties 
of the function beyond the mere fact of its continuity, is 
impossible. 

Since limn=oo w(2 jr/n) = 0, it is to be noted that 
Theorem II includes one of the theorems of Weierstrass to 
which reference was made in the opening lines of the chapter. 

In preparation for the next developments, there is occasion 
to examine more closely the proof that was given above for 
Theorem I. It will be recalled that to an arbitrary positive 
integer n a second positive integer m was assigned, in terms 
of which a function Fm 00 was constructed ; and a trigono- 
metric sum TnCr), yielding an approximate representation 
of the given function /(x), was defined as equal to an ex- 
pression which could be reduced to the form 






hm being independent of x. A lemma stated essentially that 
Fm is a trigonometric sum in ?e, of order 2m — 2 ^ n. 
It is possible therefore to write ~ the form 

1 '' 

Ano + 2 [Ank COS k {v — x) + Bnk siii k {v — j )] . 

/£=1 

When the above expression for Tn{x) is expanded as a trig- 
onometric sum in Xj the constant term is 



and is zero if the last integral vanishes, an observation which 
will presently be important, for the reason that the indefinite 
integral of a trigonometric sum without constant term is 
itself a trigonometric sum, while this is not the case if the 
sum to be integrated has a constant term different from zero. 
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It may be pointed out in this connection, though it is not 
essential to the main argument, that the coefficients Bnk arc 
all zero. This can be inferred from an elementary theorem 
on trigonometric sums, since F.m{\u) is an even function of u, 
and is also directly apparent on inspection of the proof of 
the lemma. If are the Fourier coefficients of/(x): 

rtfc = ' J* cos kv di), hk = 

and if .l„k is denoted by d„k, it is seen that 

1 " 

Tn (.x) ^ 'n rfiio flo + 21/ dtOi. {uk cos kj' + h- sin kj) . 

^ k 1 

As the (Vs are independent of the function to be represented, 
the calculation of the successive expressions T„(r) amounts 
to a method of summation of the Fourier series for /(x). 

To return from the digression of the last paragraph, let 
f{x) be a function of period 27r, which has everywhere 
a continuous derivative /'(x). For a particular value of «, 
let tn(x) be a trigonometric sum of the nth order, without 
constant term; 

Jt 

f'n (x) - 2 {ai, cos kx + fik sin k.r ) , 

k 1 

and let be a constant such that 

/(a’) — >. f-ii 

for all values of .r. fiCt fnU) be the trigonometric sum, 
without constant term, which has /n(.r) for its derivative: 

^ sin/i\r — cos /,•./•) , 

and let Vnix) — /(.r) — tn{x). Then has the period 27 t, 
andj since lrn(.r)| satisfies the conditions imposed onf{x) 

in the hypothesis of Theorem I, trith A = fn. Hence there 
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exists a trigonometric sum of the «tli order, which may be 
denoted bj'^ r„(.r). such that 

\ in {■»')— r„{x) - . 

n 

If + then /(a:) — T„(a) - r„ (a-) — '»(./), 

and 

!/(,r) -!’„(*) I < 

From the existence of an approximation fory'Cr) it has been 
possible to draw an important inference with regard to the 
approximation of f(x). If f(.r) is itself the derivative of 
a function of period 2n, so that the integral of fix) over 
an interval of length 2 /t is zero, it follows that 

( r„i.r)dx -- ( /(a)rfa — f tni->)dx = 0. 

pJ — Jl pJ—^ pj—7l 

whence, according to the second paragraph preceding, the 
sum Ta ix) given by the proof of Theorem I as an approximation 
for Knix) will have no constant term. So the constant term 
in the present Tn {.'■), defined in terms of this T„(.r), will be 
zero likewise. 

The way has now been prepared for a demonstration of 
Thf.orkm III. If fix) is (I fimetim of period 2n, having 
a pth derirntivc f^-^'^ix) such that 


\f^P\xi)—fJ’>ixi)\ < i|rs-~-.rii 

for fill real values of j\ and A being a constant, thei'e mil 
exist for every positive integral value of n a trigonometric stim 
Tni'i), of the nth (U'der, such that, for all real valuer of x, 

\fix)-T..ix)\ . 

where K is the absolute constant found in the proof of 
Iheorem 1. 

It is to be noticed that the ar^ment is based on the ex- 
plicit construction of the approximating sum in Theorem I, 
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and it is not clear that a smaller value of K in that theorem, 
justified by some different method, would necessarily be 
applicable here. The value A' = 3, previously mentioned 
in connection with Theorem I, results from the same explicit 
construction, and is available in Theorem III. 

By Theorem I itself, there exists a sum Tn\{x) such that 


Kl 

' ' n ' 

On the basis of the more recent discussion, as 


X 


(rr)! 

he more recent ( 

_f^P'>{x)d:v — y'F (tt) — 


- 0 , 


it may be understood that T„i (x) has no constant term. 
Since is the derivative of the periodic function 

furthermore, an approximating sum I'niix) may 
be constnicted for as indicated above, with 

e„ ~ KX/n. and 

I < 

n 

If > 1, (a;) is itself the derivative of the periodic 
function and the constant term in Tn'>{oi:) is zero. 
By a sufficient number of repetitions of the process, the 
theorem is established. 

In Theorem 11^ even if the inte^'al of / (x) over an inter- 
val of length 2 71 is zero, the same thing is not necessarily 
true of the auxiliary function ^(x), and it is not clear that 
the approximating sum in the conclusion of the theorem will 
lack the constant term. The difficulty is not a serious one, 
however. If 

(f'(x)dx = r, 

tf—7l 

let f/i (.r) (p{x) — c/{27i). Then 



- 0, 


and there is a trigonometric sum 7^1 (.r), without constant 
term, such that 
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But on the hypothesis that the integral of fdx over a period 
is zero, 

I 

kl = dx 




< 27r . 2ft) 



and 


5P(a:) — 9>i (a^)l ^ — ^ 4a) • 


So the conclusion of Theorem II applies to the approximate 
representation of fix) by a trigonometric sum without constant 
term, when f{x) is the derivative of a periodic function, on 
the condition merely that K' Ki{2n)-\-2 be replaced by 
K'' = KI{27i) + 4r, 

The same process of induction which was used to prove 
Theorem UI then serves to establish 

Theorem IV. If /(a*) is a ftmdioyi of period 27i ivliich 
has evei'ywhere a continuous pth derivative, with modulus of 
continuity ft)(d), there exists for every positive integral value 
ofna trigonometric sum Tn{x), of the nth order, such that, 
for all real values of x, 


\f{.c)-Tn{x) 


K” KP 


n'P 



where K is the absolute constant given hy the proof of Theorem I, 
and K" = K/{27 i) + 4:, 

A part of the content of this theorem may be restated 
in the following 

Corollary. If fix) is a function of period 2 n which has 
everyivhere a continuous pth derivative, there exists for eveft'y 
positive integral value ofna trigonometric sum Tn(x), of the 
nth order, such that 

lim nP --- 0, 

n = oo 

if e,i is the maximum of \f{x) — Tn{x)\. 
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2. Approximation by polynomials 

The development of the theory of approximation by trigono- 
metric sums will be interrupted at this stage, to make way 
for a presentation of the corresponding results with regard 
to polynomial approximation. The transition will be aided, 
liowever, by one more lemma on the trigonometric side. 

Lemma. If five) is an even function of period 2n. and 
if there is a trujonometric sum of the nth order ^ such 

that \f(x) -- Tnix) 1 e for all values of Xj there exists a cosine 
sum Cn(x) of the same order (that is, a trigonometric sum 
without sine terms), such that, for all values of x, 

\f(x)~CAT)\ - 6. 

When fix) is even, the approximating sum given by the 
proof of Theorem I will automatically lack the sine terms, 
as an immediate consequence of the fact, already pointed 
out, that its definition is equivalent to a method of summation 
of the Fourier series; and this observation would be sufficient 
for the main argument; but it is of interest to note that the 
lemma subsists independently of any particular mode of con- 
struction of the original approximating function. 

For the proof, let . 

Onir) = ~[Z’„(a-)+r„(-,r)]. 

Then Cnix) consists merely of the cosine terms of Tn{x). 
without the sine terms. On the other hand, since /(:r) is 
even, 

fix) = Y [/M+ZC— a;)], 

and therefore 

I fix) — Cn ix) I 2 [fix) — T„ (a;)] + [/(—a-) — Tni— .r)] j 

< €, 

Now let fix) be a function defined for — 1 ^ 1 ? and 

subject to the condition 

\fixi)—fixi)\ < X\xi — Xi\ 
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throughout this interval. Let 

X = cose, fix) = f (cose) = <pie). 


Then is an even function, defined for all real values 
of 6, and 

|y(«s,) — 9 >( 0 ,)| == |/(cose 2 ) — /(cosfli)i 

< A looses — COS<>i| ^ |. 


By Theorem I, together with the lemma just proved, there 
exists a cosine sum Cn{S), of the nth order, such that 


W(0) - Cn (0)\ 


KX 

n 


But a cosine sum of the nth order in 0 is a polynomial of 
the nth degree in cos^, which may be denoted by Fnix)^ 
and the conclusion is that a polynomial Pn CO exists such 
that 

\fix)~Pnix)\ 

for — 1 ^ X < 1. 

If the interval ( — 1, 1) in the hypothesis is replaced by 
an arbitrary interval (a, 6), a preliminary transformation of 
variable may be made according to the formulas 


u 


2 X — a — h 
b — a 


fi^) f\ iy), 


whereby S\{y) is defined for ^ 1 y < 1, and 


lyiCyj) — yi iv\)\ 

= \fix^ — I = 2 ^ ~ '\y2 — yi\- 

The result just obtained may then be applied to the approx- 
imation of fiiy) by a polynomial in y, which is at the same 
time a polynomial in x. The general conclusion may be 
formulated as 

Theorem V. If f{x) satisfes ihr rondUion 


\f{oi'-i)~f{Xi)\ < Ai,rs — .aI 
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ihroHghoHt the dosed interoal {a, h), of lenigth I, there exists 
for every positive integral value of n a polynomial Pn{oc), of 
the nth degree, such that 


n 


for a < X h, with L — lA", where K is the constant of 
Theorem I. 

If the smallest i)()ssible value of K were found in Theorem 1, 
it is not clear that ^ K would then be the smallest admissible 
value of L, but it can be shown that the validity of Theorem V 
is not general for any Jj smaller than i. 

More generally still, suppose that f{vi.) is an arbitrary con- 
tinuous function for a <x <.b, and let w(d) be its modulus 
of continuity in this interval. AVith b — a ~ /, let be 
the continuous function which takes on the same values as 
f(x) at the points 


1 ^ ^ 

n’ 


, h, 
n 


and is linear from each point of this set to the next. The 
function y(x), having a broken line for its graph, satisfies 
the hypothesis of Theorem V, with 

j _ w{l/n) 
l/u ' 

while 

'/(•<•) — 7' W I r- 2o}{lln) 

throughout (a, h). There is a polynomial 7 « (x) such that 
I V (*) — (r) i ^ Li>)Qln)\ 

setting L-\-2 --- L', one may state 
TiiicOKiCM VI. If f{x) is a cmithmous function with mod- 
ulus of continuity w(d) in the closed inter cal {a, h), of length 
I, there exists for every positive integral value of n a poly- 
nomial Pn {■»'), of the nth degree, such that, for a < .r < h, 

|/(a)-P„U)| < rfwQ/n), 

where IJ is an absolute constant. 
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This incident all}^ includes the theorem of Weierstrass on 
polynomial approximation, which was quoted at the beginning 
of the chapter. 

The proof of a theorem corresponding to Theorem III is 
simplified by the fact that the indefinite integral of a poly- 
nomial is always a polynomial, so that special considerations 
analogous to those relating to the constant term in the trig- 
onometric case are not needed. A new complication is 
introduced, on the other hand, by the circumstance that the 
degree of a polynomial is raised by integration, while the 
order of a trigonometric sum remains unchanged. 

Suppose that f{x) has a continuous derivative /' (x) for 
c( <x < by and that there is a polynomial of degree 

n — 1, such that 

\f'{x)—pn{x)\ ^ 

throughout the interval. Let 

f*x 

p'nix)dX — Pn{x), f{x)—p„{x) = (x). 

Since | r'n (x) | < Vn {x) satisfies the hypothesis of Theorem V, 
with A — There is consequently a polynomial tt,, (x), 
of the nth degree, such that 

\rnix) — n„(T)\ <; f . 

If Pn (x) + rtn (x) = F„ (x), this Fn (x) is a polynomial of the 
Mth degree, and 

T 1 € 

\f(x) — Fn{x)\ == \rn{x) — nn{x)\ <, - — " . 

Let f{x) have a pth derivative satisfying the 

condition that 

|/‘P>(a's)— /<P'(a;,)| S l\Xi—Xi\ 


throughout ia,h). It n—p>0, there is a polynomial of 
degree n — p which differs from fiy more than 
LlX/(n — p) throughout the interval. There is then, by the 
preceding paragraph, a polynomial of degree n — 2> + l, 
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differing by not more than — + from 

and so on. Finally a polynomial Pn{x) is obtained, 
of the wth degree, for which 


\f(x)— Pn{x) i 


in — p)(n — p + l) ••• w ’ 


For the applications, the existence of a constant in the 
right-hand member of tlie last relation is far more important 
than any close estimate of its numerical value. It is most 
convenient, even at considerable unnecessary expense nume- 
rically, to be satisfied with the observation that 

1 _ 'ti ji 

(n — p){n — 7 ;Hhl)-*-w — 2^ + 1 

n — 1 p\ 

for 7i>p-\-l, and to state the result in the form of 
Theohkm VII. If f{x) has a pfh derivative f^P^{x) saiisfyinij 
the vondiiion that 


fhrowihout the dosed interval (a, h), of length /, there exists 
for every integral value of n>p a polynomial Pn{x)\f the 
nth degree, ffuch that for a<xi^h. 


!/(./•) -P„(.7’)! 


Lpi^[^ i. 


ivhi'ii' Lp ly\, and L is the constant of 

Theorem V. 

It is elesu' that even with h 0 the hypothesis implies 
nothing whatever as to the possibility of approximating f{r) 
by a polynomial of degree lower than p, since f(x) itself 
may then be any polynomial of the pth degree. By suitable 
changes in fomnlation it would be possible, though of 
secondai’y interest, to admit the value n 0 in Theorem V 
(or Theorem I), and the value n — p here. 

From Theorem VI, by reasoning similar to the above, one 
may deduce 
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Theorem VIII. If f{x) has a continuous 2 )th derivative with 
modulus of continuity throughout the dosed interval {a, fcj. 
of length I, there exists for every integral value of w>p 
a polynomial Pn (x)y of the nth de^reCy such that for a ^ x > fc, 

where Xp — {L-\‘2)jp\y and L is the conslunf 

of Theorem V. 

Corollary. If /(jc) has a continuous j)th derivative for 
a <x ^b, there exists for every positive integral value of n 
a polynomial Pn {x)y of the nth degree, such that 

lim “ 0, 

n=oo 

tvhete €,1 is the maximum of fix ) — Pn{x)\ in the interval 
{a, h). 

The exceptional status of the values ot n^p has no 
significance for the corollary, Avhich is concemed only with 
a limit for n — <x>. 

3. Degree of convergence of Fourier series 

The preceding theorems can be made to serve as basis 
for a discussion of the convergence and rapidity of convergence 
of Fourier and Legendre series, as well as of other processes 
of approximation. This idea will be developed more fully in 
succeeding chapters; its first consequences will be presented here. 

With regard to the Fourier series for a given function fix), 
it will be premised merely that it is a series of the form 

00 

2 («fc cos kx + bk sin Irx), 
ill which the coefficients have the values 

Uk ~ ^ \ fiOcoi^ktdt, bk ^ ^ I fit)fimktdt, 

7t tj—7i ft — n 

the formula for being applicable when k = 0, as well 
as when k is positive. The expression 
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Sn (x) — ^ COS kx + bjc sin kx) 

J fc— 1 

will be called the ‘^partial sum of the series to terms of the 
wth order/’ 

If the quantity 

^ -4-cosm + cos2m4- ••• +cosn?/ 


is multiplied by ‘isin^te, the product may be rewritten as 


sin “ + [si 


3u 
sm -g- 


sin 

• . • + [sin |n 4- 2 ) " “ (” “ |") ■ 


which immediately reduces to sin(w + i)?f, so that 

1 I I I I sin (?i -f- i) i( 

- -f- cos U cos 2 U + • • . + cos 7l 71 - . 

2 2 sin i ii 

Let each coefficient in Sn(x) be replaced by the integral 
expression which defines it. Since cosA*^^’ and sinkx are 
independent of /, they may be written under the sign of 
integration, and the various integrals may be combined into 
a single one: 


(x) — ^ ['2 ^ ^ ^ I ^ 




si n(?i + (t — x) 
2 sin i {t — .t) 


(If. 


From the last expression may be deduced the following: 
Lemma. If fix), of period 2n, is homided and infegrahte 
(in the sense of Riemann or in the sense of Tjebesgue), if 


[fix)\ < M 


for all values of x, aiid if 8n ix) is the partial sum of fhr 
Fouri(T series for fix), to ta'vns of the nth order, then 
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SnixY CMlogn, 


fo7' all values of x and for all values of n7''l, wliere C is 
an absolute constant, depending neithen* on x, nor on w, nor 
on the function f{x). 

By the hypothesis on /(./•), 



sin (n + i) 

2 sin i — X) 


dt. 


If one-half the integral on the right is denoted by jn, its 
form may be modified by the substitution u — ^(t—x), and 
by recognition of the fact that the resulting integrand is an 
even function of u, of period tt, to yield the conclusion that 


Jn — 

1 pni 2 I sin (2u + l)u 



sin (2« + l)tt 

J— (7r4-a?)/2 

sinu 


da 


1 nnl2 

2 J -71/2 I 


sin u 


du 


1*71/2 sin(2n + l)M 


Jo 


sin u 


du f 


!S„(x}l < 


From the fact that 

„ 1 + 2 

CITI 


SIHM 


k 1 


it follows that 


sin (2n + 1 )m 


smu 


^ 2« + 1 


for all values of h. On the other hand, |sin (2«+ l)«i ^ Ij 
while 

sinu ^ sin {nl2 ) 2 < 'T J_ 

u nr/2 7T ’ sin it ^ 2 n 


throughout the interval of integration. 8o 
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The last expression does not exceed a constant multiple of 
log », for « ^ 2, and the conclusion of the lemma is justified. 

The method of application of the lemma, which is due to 
Lebesgue (though he did not make so extensive use of it as 
is done here), may be summarized in 
Theorem IX. If fix) is a tvntinnous functim of period 2 > r , 
and Sn(x) the partial sum of its Fourier series to terms of 
the nth order, n ^ ■ 1 , and if there exists a trigonometric sum 
Tnix), of the nth order, .such that 

|/(r)— Tnix)\ e„ 

for at I values of x, then, for all values of x, 
f{x) — iluix)\ ^ iff„l0gM, 

trhe}e li is an absolute constant. 

The statement is equally true, though of less interest, if 
fix) is merely assumed to be integrable; of course can 
not approach zero, when the relations arc considered for 
successive values of n , unless fix) is continuous. 

Let the sum T„(x) in the hypothesis have the expression 

U 

■^ + .2/ («k COX + tin sin ^3^)- 

It is found by direct integration that 

- Tn{t) co^kt (It = Of/., - r* Tn{f)^mkt (If = fik, 

n J—71 jT J—n 

for k so that the partial mm of the Fourier series Jor 
Tn{x)^ to terms of ordet' n, is identical with Tnipc) itself If 
the Fourier coefficients of /(r) are au, hk, those of the function 

Bn{x) =f{x)-Tn{x) 

are ak — eck, hk — fik, for A ^ w, and the partial sum 
of the Fourier series for En(x), to terms of order n, is 

Sn{x) — «o)+ 2 [{ok — f^k) coskx-\-{bk — A-)sin/fx] 

^ k= 1 

Sn(x)-Tn(x). 
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Since |i2nfrr)| < c„, it follows from the Lemma that 
Ia-„(j-)| ^ Cen log n. 

Consequently 

\f{x) - Sn (o:) \^\J{x)~ Tn (o;) I + I Tn (.r) - Sn {x) i 

= i ! + 1 Snix) | ^ + Ctn log n < -t' Csl log n 

for n ^ 2, and the last expression has the form B fn log n, 
with B = (log 2)“' + 6’. 

The theorem may immediately be specialized and made 
more definite by combination with Theorems I — IV, as follows: 
Corollary I. If 

!/fe)— /(a-i)l < X\xs—xi\ 

for all values of Xi and x^, ^ being a constani, then 

Corollary II. If f(x) is continuous with modulus of con- 
tinuity CO (d)^ 

i/(a::) — »S'„(a;)| ^ A o) ) log n. 

Corollary Ila. The Fourier series converges uniformly to 
the value f(x)y if f{x) has a modulus of continuity co{d) 
such that lim<f=^Qa)(d) log d = 0 (Lipschitz-Dini condition). 

Corollary ni. If f{x) has a yth derivative f^^^{x) such 
that 

I (x* ) (x, ) i ^ X\xt—Xi\ 

for all values of Xi and x^y ^ being a constant, then 

Corollary IV. If f{x) has everywhere a continuous pth 
derivative with modulus of continuity «(d), 

I /(x) — Sn (x) I < ^ « iffj log n . 
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In each of these statements, the conclnsion holds for all 
values of x, and for all values of n > 2; the coefficient A is 
an absolute constant, while Ap depends only on p. 

The corollaries are stated separately for emphasis;, it is 
clear that all are included in Corollary IV, if it is undei- 
stood that p may in particular have the value 0. The use 
of the same ^ in I and II, and of the same Ap in III and 
IV, signifies merely that when two constants are concerned, 
one symbol may be used to represent the larger of them. 

It is worthy of note that even the dependence of Ap on 
p can be eliminated, with a resulting simplification in Cor- 
ollaries III and IV which does not have a counterpart, as 
far as the present evidence goes, in the case of the corre- 
sponding third and fourth Theorems. 

If f{x) is a function of period 2 7r having a continuous 
derivative, the Fourier series for /(x) is that obtained by 
formal differentiation of the Fourier series for f{x). This 
is recognized without any further assumption as to the con- 
vergence of the series, from the relations 

J_^/' (0 ht dt — ]{ sin kt dtj 

(^71 nn 

f {i) m\ kt dt = — k f{t)cQs>ktdt. 

rj—n tj—n 

which are obtained by integration by parts, the terms which 
would appear outside the integral sign reducing to zero, 
because of the periodicity of the functions involved. If the 
coefficients in the series for f{x) once more are 6/tc, and 

if /(a:) has a continuous derivative of order 2q, where q is 

any positive integer, the Fourier coefficients for (x) are 

~ ( — 1)^ ofe, fik = ( — hky which means that the 

series for f(x) can be written in the form 

-~ + (— 1)^ ^ («/c cos kx + fik sin kx ) . 

The partial sums of the series for f{x) and for to 

terms of the wth order, may be represented by Sn{x) and 
(a:) respectively. 
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Let 


and suppose now that |$„ (a;)| ^ where fn-u < for all 
values of n that are considered, and lini«=jo «»»--• 0. Then 


aiicosA-.r + /?fcsin/.-a: i (a;) — (r) . 


By any of the preceding corollaries. JIa, for example, the 
series for fix) converges to the value of the function, while 
the series '^Qk-iix)!!^, ^ekix)/}^^ arc convergent, because 
I Qk (x) I is uniformly bounded. Hence the remainder /(x) — N„ (r> 
can be written and rearranged as follows: 

f(x)-~Sn{.l) = (—1)'^ ^ ^(atcos/.-3' + /Sksinfrj) 

k nfl 

---■ (— 1 )’ 2 L [efc-1 Cr) — (?fr (.r)] 

( W — (FFT)» ) ® ■ 


As the parentheses in the last summation are all positive, 
and as |^fc(ir)' ft ^ for k '' n. 


fix) — S„ (a:) I 




00 / 

”, ^ (-} 

k—nf\ \f 


2 fn 

(n 4' 1)^ ^ ’ 


1 


1 

ik+iy^i 




Under the hypothe.ses of Corollary III, letj) — 2q if p is even. 
]> — 2f/ + l if iJ is odd. Then Corollary 1 or Corollar3" II 1 
may be applied directly toy'=*«*(a-), which is found to satisfy the 
requirements of the preceding paragraph, with e,t (Aillogn)/)i 
in one case and f„ ^ U, A log n)/«* in the other. The 
corresponding values of 2e„ln^ are (2.U lognl/n^f ^ and 
(2 ill A log nVriP"'^ respectively. For literal accuracy, it must 
be admitted that (logw)/n diminishes only from n — 3 on, 
and that the conclusion has been established, when p is even. 
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only for n ^ 3 . The value n = 2 may be included if the 
previous is replaced by a somewhat larger quantity, with 
a suitable adjustment of the constant in the conclusion, if 
necessary. 

Similar reasoning may be employed under the conditions 
of Corollary IV, except that it is no longer satisfactory to 
take 2 q — when j? is even, as o){2n ln)\ogn might not 
decrease with increasing /?. It is sufficient, however, to let 
p ~ 2q-\-2 when p is even, p = 2 q-\rl when p is odd, and 
to obtain by the application of Corollary IV to 
as thus defined. Prom the definition of o)(d) it is certain 
that o) {2 71 / 71 ) itself diminishes, or at any rate does not 
increase, as n increases. 

All the cases in question are covered, with some redundancy, 
if the following are considered successively: for (f)(d)<Ad, 
p = 0, I, 2 q {q 'g: 1), 2(/4-l iq ^ 1); for general 
p = 0, 1,2, 2^ + 1 2^ + 2 (( 7 > 1 ). If a single letter 

is used to represent the largest of the finite number of con- 
stants entering into the corresponding conclusions, the result 
may be formulated thus: 

Theorem X, The precedmg Corollaries III and IV may be 
restated^ for all values o//?>0, ivith the mulfijdier Ap replaced 
by an absolute constant Z>, depending neither on p nor on 
anything else. 

4, Degree of convergence of Legendre series 

A considerable part of the above reasoning can be carried 
over to the case of Legendre series, though the relations 
are less simple than for Fourier series, and the results as 
presented here will be less complete. 

By the Legendre series for a given continuous function /(:r) 
is meant a series of the fonn 

^^0 A() (:r) -f- Xi (a:) U 2 Ag (a::) -[-••• 
where A^Or) is the Legendre polynomial of the A'th degree, and 

a, = 
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Not to work out the theory ot these polynomials from the 
beginning, it will be assumed as known that one of them 
is defined for each integral A'alue of Jc^O, the first two 
being Ao = 1, A\ - r; that they satisfy the relations 

f\ Xiit) = 0 (j t A-), ^ ; 

that any successive tliree of tlieni are connected by the 
recursion formula 

and that the polynomial of the A:th degree can be expressed 
in the form 

AfcC'/*) = - I [a’+ ^(l — cosy]* rfy . 

^ Jo 

In the last expression, the presence of imaginaries is only 
superficially apparent; if the integrand is expanded by the 
binomial theorem for a positive integral exponent, the co- 
efficient of each odd power of / after integration is an integral 
which is seen at once to be equal to zero. 

Let ASn(a) stand for the sum of the first w+1 terms of 
the series: 

iSn — ^^0 Ao (a;) + A'l (a*) + tin Xn (x ) . 

By the definition of the coefficients, 

Sa{x) t)df, 

where 

Kn U', t) ---- 2 [^0 (x) Ao (0 + 3 Aj (./•) Ai (<)+••• 

... +(2n+l)A„(a:)A„(0]. 
This function can be rewritten in the form 

IT / j\ M-l-l A>t4i(x)An(0 Afi(3j) A»-(-i(f) 

Knioc, f) - --g — ; 
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the identity is immediately verified for n ~ 0, and may then 
be proved in general by a straightforward process of induction, 
based on the recursion formula. 

In the integral expression for when j- 1. 

— aj^y^^cosyl — [aj*+(l — j;*) cos*y]'^* 

-< + - 1 , 

and hence | Xkix) \ ^ 1. There will be occasion to use a closer 
inequality for \Xk\ in the interior of the interval (—1. 1). 
To return to the integral representation, 

' Xk (.r) ■ - ( [a:* -i- ( 1 — ./■•) cos* >l </< 

^ Jo 

2 T"'* 

— [a* + (1 — a-*) cos* cl <f> . 

Ui) 

the last equality resulting from the fact that cos* v- = cos* (.r — r/.). 
Since 

./•* + (1 — a*) cos* y — - cos* <f> + X* .sin* y =— 1 — (1 — a™*) sin* . 


and since sin y ^ 2 throughout the interval of integration, 
while 1 — X* is positive for the values of ./• under considera- 
tion. it follows that 

a-*+(l-.>*)cos*y < 1— l-§*y*. 

jT 

if ? — 2(1 — x*y>'-ln. By an application of the extended 
mean value theorem to the function e~y, 


1— y+ 


y*e- 




so that e'J'r'l — y for all real values of y, and in the 
present connection 

l-?*y*--.e JV. 

So 


2 

TT Jo 
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and, by the substitution u — ky- 'i y , 

98/2 .. 

-Sa- (») I •> = /,.1/2^iL' "*)rr2 

where ti is independent of k and x; the numerical value 
r, -- (/r/2)'^ is not essential for present purposes. 

From the relation just obtained it follows, first, that if .r 
is restricted to an interval — I + i ? t > 1 — i »? , 0 ^ < 2 . 

then 

' Xkix) i > jil , . 

where g is independent of k and x, but depends on and 
secondly, that 

(I “.2)1/2 > 

where cg is independent of A*. 

It is possible now to proceed to the proof of the followin^>’ 
lemma, which assigns an upper bound for |ASn(ic)|, not through- 
out the entire interval — 1 < a? -- 1 , to be sure, but through- 
out an interval interior to it: 

Lemma. If f{x) is hoxmded and integrahiv (in the sense 
of Eiemann or in the sense of Lebesgue) /o?’ — 1 < :r‘ < 1. if 

i/(^)l ^ M 

throughout the interval, and if Sn(o') is the partial sum of 
the Legendre series for fix), (hen 

l/S-nCa:)! ^ GM\o%n 

for — 1 — 17 , 0 <//< 1 . where O does not depend 

on X, n, or the function fix), hut does depend on >/. 

The proof starts from the fact that 

i<S’„(a:)| < ^\Knix, t)\dt. 

Let the interval of integration be divided into five sub-inteiwals, 
terminated by the points 

1 1J_ 1 1 _L ^ 1 1 1 

-1, a:-- 1-y^, 1- 
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Under the hypothesis that — — tj, these points 

wUl succeed each other in the order named, as soon as 
it, >2/^. It will be assumed for the present that this con- 
dition is satisfied; then, in particular, wfO, and as there 
will be occasion to observe incidentally, [(w -f 2. 

Let the values of the integral of \Kn over the sub-intervals be 
denoted by 7i , •••, h respectively. The relation iZA(<)i < 
can be used in the second, third and fourth integrals, and 
the relation which is independent of t, in 

any of the five. In the middle interval. 


Knix, 01 - I i; \ (2/. + 1)Aa(./) A',,(oi 
2 ^a 6', 2 ^ ^ ^ /.-v-i *>-’ 

n 

- 1 + Z 2//^* 1 +2f)*n < (2i/*-!- 1) «. 

k I 

and hence 

h ---- I K„{j\ t) (If {2(f-+\)n(ff = 2(2//- -f II. 


The representation of t) as a fraction is to be used in 

the remaining integrals. In the first interval. ./■ - -0 ''^v. 
and 


r ^ 




L 



^ w + 1 

1 A„ 1 1 (x) 1 I 

'^X,Af).ilt 


e/ 

-1 

1 




-fLV„(.0|J_, A'„,i(Oh//| 

W + 1 1 
1 

L(w-t-i)''- J 

C.i 


n -1- 1 

2vig 


. 4c*(y 

^ V 
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In the same way, In the second interval, 


I Xn+l (X) X„ it) - Xn ix) J„+l it) I 

. ^-9 * --<r2(7* 

= 2 '[wCw+DF 

SO that 

J *x-lln nx—ifn j , 

-1 f ij U -1+ 2 ? ' 

or, by the substitution a — i = «, 



2g* (log 2 4- log «). 


Similarly, < 2//* (log 2 + log «). As n is an integer satis- 
fying the condition w>2/i|/"-2, it is certain that n > 3. 
logn>l, and the inequalities that have been obtained for 
/i , • • • , Jj will merely be strengthened if the factor log n 
is inserted in the right-hand members wherever it does not 
occur. By combination of these inequalities, 


£jA',.(r.oi 


d f S th log n 


for n7>2lfi, the number Gi depending only on For each 
value of n belonging to the range 2 < n < 2 / 1 /, the integral, 
considered as a function of x for — I + ^ ^ a; 1 — ^ > has 
a maximum value. Let 6'* be the largest of the finite 
number of maxima thus determined. Then Gg depends only 
on , and the statement of the lemma is true for all values 
of n > 2, if fr is taken as the larger of the numbers (?,. 
Gi/Iog 2. 

For the application of the lemma, it is to be noticed that 
an arbitrary polynomial of the «th degree can be expressed 
identically as a linear combination of Xoix), •••, An(x). 
with constant coefficients. Then a process of reasoning 
which is entirely analogous to that used in the proof of 
Theorem IX, and which need not be repeated at length, 
serves to demonstrate 
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Theorem XI. If f{x) is a cantinuotis function for 
— 1 ^ X ^ 1 , and J^n (x) the sum of the first w + 1 terms 
of its Legendre series, w > 1 , and if there exists a polynomial 
Pn (x), of the nth degree, such that 

|/(a;) — P„(.r)| tn 

for — 1 < a; < 1, then, for — 14*^7 ^ ^ ^ 1 — ^ (^>0), 

I / (x) — Sn (x) \ H €n log n , 

where H depends only on 

The more specific results obtained by combining this prop- 
osition with Theorems V— VIII will not be formulated sepa- 
rately, but will be summarized in a single 
Corollary, Iff{x) has a continuous pth derivative {p > 0) 
for — 1 % X 1, ivith modulus of continuity w(d), and if 
(x) is the sum of the first n 4’ I terms of the Legendre 
series for f{x), then 

for n > p and for — 1 4 ^ ^ 1 — Vi ivhere Hp de- 

pends on V and on p, hut not on anything else: in parti- 
cular, the series converges uniformly to the value f{x) for 
— 1+1? 1 — ij, if f(jc) iUelf has a modulus of con- 
tinuity «(d) for — 1 1. such that lim oi(d)logd.= 0 . 

(f=0 

Since // may be arbitrarily small, the last conclusion implies 
that the series is convergent, not necessarily uniformly, 
throughout the open interval — Ka*<l. 

Gronwall (Mathematische Annalen, vol. 74 (1 9 1 3), pp. 2 1 3-270 ; 
Transactions of the American Mathematical Society, vol. 15 
(1914), pp. 1-30) has shown essentially that 

j ^ Kn{x, t)\dt < 

for — where Go is an absolute constant. It follows 

that conclusions analogous to those of the preceding theorem 
and its corollary hold for the entire closed interval ( — 1, 1), 
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if the factor logw is replaced by the corresponding 
siifticient condition, for nnifom coiivergenee being that 
linirf, „ - - 0. Ihit the proof of the relation of in- 

equality for the integral appears to be rather long, and will 
not be set forth here. From the deflnition of K,,. together 
with the facts that |vYfc(j-); ;:i 1 for — 1 ■ ./• - I and that 
J Xk(f) df -■ it is obvious that 

J *i I « 

fit ■ M- y (2A + l)c. 

1 2 I: 1 

where ca is an absolute constant; and it can be infeired at 
once that the series converges uniformly to the value /(or) 
for — 1 ■ . 1 ' 1 . if fU) has a first derivative with a modulus 

of continuity o (6) such that lim,j- (, w(«l)/d''- ■ 0. while 

there are corresixuiding theorems on degree of convergence. 
In a later chapter, conditions will be obtained which are 
closer than those thus indicated, though not so close as the 
ones corresi»onding t(t the factor 

From the discussion of approximation in terms of polynomials 
and trigonometric sums it is natural to pass to similar questions 
with regard to dcveloiunents in series of more general functions, 
particularly the characteristic functions defined by linear differ- 
ential equations with boundary conditions. The beginnings 
<tf such a theory hav(‘ been ))resented by the author (Ti’ans- 
actions of the American Mathematical Society, vol. 15 (1914). 
pp. 439-46(1) and W. K. Milne (the same Transactions, vol. li> 
(191H). pj). 143 156). The present account, however, will 
be continued along other lines. 
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DISCONTINUOUS FUNCTIONS; FUNCTIONS OF LIMITED 
VARIATION; ARITHMETIC MEANS 

Introduction 

The discussion hitherto has been concerned almost entirely 
with uniform convergence, and with functions that are con- 
tinuous throughout the interval under consideration. Corres- 
ponding theorems with regard to the approximate represen- 
tation of discontinuous functions are naturally less simple, 
and perhaps of less immediate interest. It will be well, 
nevertheless, not to disregard entirely the question how far 
the theory that has been outlined can be brought to bear 
on the representation of such functions. This question will 
occupy the present chapter. (For a somewhat different set 
of theorems on the approximate rei)resentation of discontinuous 
functions, reference may be made to a paper by C. E. Wilder, 
in the Rendiconti del Circolo Matematico di Palermo, vol, 39 
(1915), pp. 345-361.) There will be occasion also for the 
further development of the theory as applied to continuous 
functions, by reference to the concept of limited variation, 
and by study of the summation of Fourier series according 
to the method of the first arithmetic mean. A beginning 
will be made by a review of some well-known general theorems 
about Fourier series. 

I. Convergence of Fourier series under hypothesis 
of continuity over part of a period 

Let J\x) be a function of period 2/r, which is summable 
over a period, together with its square. (Without change in 
the form of the argument, the discussion can be kept elementary 
for the present, with results which still have a high degree 
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of gcneralit}', if it is assumed instead that J\t) is boiuided 
and integrable in the sense of Bicmann.) Let 

1 n I 

<tk — I f{t) VOS ]{t dt, hk= I J\t) sin kt (it, 

■r J~ i J n 

H 

.V„(a) ^ -i- ^(«ftCos/fj' + t»fcsm/.-,T). 

- • k=l 

Thou, as may be verified by multiplying: out the square and 
integrating: term by term, 

U —n u ^ U-Ti 

— — f L/'(a;)]*<?3 f-*^ + 2 («t+ ^ic)] ■ 

nj-n I -S fc=l J 

As the first member can not be negative, it must be that 

f + i; (ai+ bi) < If m^)Vdx 

for all values of n, and hence that the series 

^ fc .1 

is convergent. It follows that for any function f{x) of the 
character specified, the coefficients an, bk approach zero 
as k becomes infinite. The identification of the sum of the 
series with the value (l/ar)J [/(a;)]* da: is not needed for 
present pui’poses. 

The assumption that [/(a:)]‘ is summable is not essential 
to the truth of the conclusion. On the hypothesis that /(a-) 
itself is summable, let /jv(®) be the function which is equal 
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to /(;/•) when |/(£c) | < N, and equal to 0 when \f(x) \ > N. 
Let aic, hk be the Fourier coefficients of f(x), and uhn, hs 
the corresponding Fourier coefficients of /jv(ar). It is known 
from the earlier work that limjt akN = 0, limuc^x hkx -- 0, 
for fixed .V, since /n (a") is bounded. Let e be an arbitrary 
positive quantity, and let N be taken so large that 

^ f l/(a;)— f. 

^ 0—7t J 

The hypothesis of summability implies that such a choice 
of iV is possible. Then 

7 J" ^ I [/W — /v (a)l sin J:x d.r ' t . 

for all values of k, and hence 

|ajc — nkN' ^ 2 i 

But for the particular value of A' in question there is a k,, 
such that Ifljicwl ^ I 5*^1 < for k ^ ko, and for 
values of k subject to the last condition it follows that 
1 Ufcl < f, j 6fc| < *'. If f{x) is a mmmable function of period 
2n, and if ak, bk are its Fourier coefficients, 


lim flfe - 0. lira hu — 0. 

/c = 00 fc-00 

Now let/(.r), still .summable and of period 2n, be supposed 
to vanish identically for xo — i) < x ^ j’o + 7, 

As was shown in the first chapter, 


Nn (j'o) 


1 

n 



sin (« + 

2 sin i (t — a-u) 


dt. 


Let t — j‘o If, inasmuch as the integral of a periodic 

function over a period is the same, wherever the initial 
point is taken, the limits — tt, tt may be retained after the 
substitution: 
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sin(«4-i)»* 

2sinit( 


du, 


or, after expansion of sin (nn + ^ 11 ) by the addition theorem, 


1 T’* 1 

•''’..(-To) = 2^ + ^ Hsmnudti 

1 C' 

^ 9 ,r j nudu. 

The factor cot \ u is bounded and continuous over the range 
of integration, outside the interval where f{xo + «) vanishes 
identically. Hence /(a^o + «) cot^M, like /(ajo + u) itself, 
is a summable function (or a bounded function integrable in 
the sense of Riemann, if this hypothesis was originall}’ im- 
posed on /(t)). and the reasoning of the preceding para- 
graph is directly applicable to show that both integrals in 
the last expression approach zero as « becomes infinite; 
that is. 

lim S\(xo) 0. 

/I— X 

More generally, if fix) and y(j) are two summable functions 
(or two functions satisfying the alternative hypothesis), if 
Snix) and «,,(./■) are their respective Fourier sums of the 
»tth order, and if f (x) and y (rr) are identically equal for 

Xo — >i ■ - X < x„-\-^, then 

lim [iV„ (./o) — «„ (r,,)] - 0; 

n--=oo 

if tha Fourier seyies for (p (x) ronverr/cs for x - ■ to the 
value ^(xo) ^ fM, the Fourier series for fix) does the 
same. This is commonly expressed by saying that the con- 
rergenre of the Fourier series for a giveyi function at a speci- 
fied point depends only on the hehavior of the function in the 
neighborhood of the point; it is tacitly understood that only 
functions of some specified class are considered, as, in the 
present connection, summable functions, or functions which 
are bounded and integrable. 
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It is possible then to pass immediately from ('orollary Ila 
of Theorem IX in the first chapter to the following statement : 

Theorem I. If f(x) i<t a mmmahle function of period 2 nr (or, 
less generally, a function of period 2 n which is bounded and 
integrable in the sense of Kiemann), and if there is an intei-- 
nal (xo — a:© + 7), 0<^<r7r, throtighmd which f(x) is 
continuous, with a modulus of continuity <0 (d) such that 
limj-^o w(d) log d = 0, theti the Fourier series for fix) con- 
verges for X -- a'o to the cahie f(xo). 

For a periodic function y(aj) can he defined as equal to 
fix) in (.To — fl, + Jind linear, say, from + to 
To — 7 + 2/r, and this <fix) will have a modulus of continuity 
satisfying the requirements of the Corollary cited. 

Attention will next be directed to the simplest case of 
convergence at a point of discontinuity, , Let ftoS) now he 
a function which has a ‘‘finite jump", or discontinuity of the 
first kind, at the point x — xo, approaching limits which may 
be denoted by /(.to — ) and/(To+) as x approaches to from 
the left and from the right respectively. Let it he supposed 
that the values of fix) in the interval ./o — < x~^Xo, to- 
gethw with the value /(xo — ), form a continuous function 
with modulus of continuity «,(d), and that /(.r) is likewise 
continuous, with the other limiting value at xo . in the interval 
from .fo to Xo-f-7, the modulus of continuity this time being 
o>s(d). For each value of d, let «(d) represent the lai^er 
of the numbers <»i, <«»*. It will be said briefly that fix) is 
continuous for Xo — 7 ^ x < .To+?. uith modulus of con- 
tinuity w(d), except for a finite jump at the point Xo. It is 
understood always that fix) is summahle over a period. If 
limj'^^ow(d)logd = 0 , Theorem I establishes the convergence 
of the Fourier series for fix) at all interior points of the 
interval (xo — xo + ll), other than .To, and it remains to 
consider the question of convei^ence at the point xo itself. 

For this purpose, it may be assumed without loss of gene- 
rality that Xo - 0 . For the terms of the series which involve 
coswx and sinnx, taken together, are identical with the 
corresponding terms in the development of fix) as a function 
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of X — :ro. In formulas, let x — Xo = y, f{x) -- 9 K 2 /), 

t — xq u. Then u — y t — r, and 


pTT p7r 

coswy J ^^ 9 )(?«)cosn^MZ// + sin7i7/ J ^(p{u) unmt du 

£ p7l 

^ (p (u) cos n {u — y) dn ^ \ ^f{t) cos n{t — x) dt 
^ cos n ^ J^^/(^) cos n ^ ^ ^ + sin nx J^^/(0 sin nt dt. 


The assumption that a:o = 0 is therefore equivalent to a change 
of variable which does not affect the conditions of the problem. 

With this understanding, let the symbols /(:ro+) i^nd /(^o — ) 
be replaced by /(0+) and /(O— ), let 

/i {^) = fix) (0 < ./• < vt) , /i (x) ^ J\ i—ji) ( — 7T .r < 0) , 

y;(0) =/(0+), 

and let f%ix) similarly be an even function identical with f{x) 
for — /r < a: <0, and taking on the value /(O — ) for = 0. 
Let Sniix) and Sn^ix) be the partial Fourier sums for /iCr) 
and fi(:x) respectively. The function fix) is continuous 
throughout the interval — f/ If :^i and X 2 are two 

numbers belonging to this interval, and if Us, — Xi ' < 6, 


if Xi and x^ have the same sign, or if one of them is zero, and 

I /x ix2) -f (x,) 1 < I f, (X2) -f (0) I + 1 /, (0) -yi (x,) i < 2 u>(d) 

if Xi and Xg have opposite signs. Similarly, f^ix) has a 
modulus of continuity < 2«(d) for — V ^ x < rj. If it is 
supposed that lim^^oft)(d)log5 ^ 0, 

lim Sni (0) -- f (0) = /(O +), lim Sn^ (0) A (0) = /(O -), 

n=oo n 00 


by Theorem I. But 
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■''-■'O'-iryw 


sin(w + ^M . 
2sini/ 



sinfw + iX j, 

— Hi — at, 

2 sin 1 1 


sm(n + i)< ,, 2 r® sin(w + i)< 

2sini< 2smi< 


(It, 


sin(^4 + i)/ 1 

■2sinJ/ “2 


[>s«i(0) 4'>^//2(0)], 


and therefore 

linwSUO) 4^[/(0+)+/(0-)]. 

W - 30 ^ 


The conclusion may be appended to the theorem as 

CoHOLLAHY I. //'./(.r) satisfies the conditioyis of Theorem I, 
except for a finite jump at the point x^, the series converges 

at to the value \[f ^ ■\-JXoco—)]. 

So far, nothing has been said about uniformity of con- 
vergence. To lead up to a discussion of this topic, the 

following lemma will be established: 

Lkmma 1. If f{x) is an arhitrary simmahle function, the 
integrals 

Ak(y) = f /(^) cos Bu{y) — f /(/) sin A/ eif /. 

approach zero, ask heconu^s infinite, anijormly for — 

If /(a) is bounded and integi’able in the sense of Riemann, 
the proof holds without change of form when the integrals 
are thought of as Riemann integrals. 

In the first place, it is clear that 

linifc. oo^JfcO/) = limfc ^Bk{y) -- 0 


for any fixed value of y in the interval. For Ak (yV^i and 
(.y)/^ iire the Fourier coefficients of a function W(x) which 
is equal to /(.r) for — rr < x y, and equal to 0 for 

y '.x < n. Let € he any positive, quantity. Let 

<11- 

This function is continuous for — ir ^ y < .t, and so 
uniformly continuous. Let d bo a positive number such that 
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if yi and y, are any two points in ( — n, n) for which 
12/8 — Let N be an integer, for definiteness the 

smallest integer, such that 2;r/iV<d, and let zj = — Jt-\-{^jnlN), 
j = Q,l, N. By the remark made above as to the 
behavior of Auiy) and Bk{y) for fixed y, there is for each 
value of y a number fcj such that <ie, \Bk{zj)\<\f. 

for k > kj. Let k' be the largest of the numbers ko, • k^. 

If y has any value in the interval ( — u:,n), there is a zj 
such that 0 < y — zj<S. Then, for any value of k, 

^Akiy) — Ak{zj)\ IJ^ fit)cosk(dt \fit)\dt 

= .4(2/) — 

liifc (2/) •)!<-*. 

If /■• :: //, 

Akiu)] < \Ak{zj)\-^\Ak{y)~ Akizj)\<ey \Bk{y)\<€. 

This is equivalent to the conclusion of the lemma. 

The existence of the period 2/^ for f{x) being understood 
throughout, let the definition of Akiy) and Bkiy) be extended 
to all real values of y. For any iji and i/g, 

py, 

f(t)cosht(lt -= Ak{yi) — Ak(yi). 

Here and subsequently, each relation written down for Ak 
has a counterpart involving Bk. Let €k be defined for each k 
as the larger of the maximum values attained by \Ak{y)\y 
\Bk(y)\in{—7t^7t). In particular, \Ak{n)\<€ky \ Bk (tt) | 

The lemma just proved is equivalent to the statement that 
limk^oo = 0. Because of the periodicity of f(t) cos kt 
and /(<) sin A:^, the integrals defining Ak(y) and Bkiy) are 
not altered in value if both limits of integration are increased 
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or diminished by the same integral multiple of 27r. For 
example, if ^ is in the interval (tf, Stt), 


\Au(y) — Ak{ji) I < €u, 

whence it follows that \Ak{y)\^2€)t in this interval. If yi 
and are anywhere in ( — n, S/r), the corresponding values 
of Ak will differ by not more than 4^^. Finally, if y^ and y^ are 
any two numbers subject to the condition that \ yi — yi\^2n^ 
there will be an integer j (positive, negative, or zero) such 
that yx-\-2jn and y%-\-2jn belong to the interval ( — tf, Stf), 
and 

\Ak(y^) — Aic{yx) \ ^ 4fA:. 

A similar relation holds for Bk. 

The purpose of these details is to bring out a fact which 
will be used presently. For any Xj and for any y in ( — tf, tf), 


7l-tX 


fit) sinn(^ — x)dt 


- I Q.()%nx\Bnix-\-y)—Bn{x — n;)\ — ^\iinx{Anix‘\-y)—Anix—7i )\ | 
' ^ 8 ^ n > 

or, by the substitution i — x ^ u, 


z 


f(x-i~u) sin wn du 


Sfn 


for arbitrary x and t/, subject to the condition that — ^ Sy 
^71, (It is clear that the interval from — tf to y may be 
replaced by any other interval of length <2n,) 

The next stage of the discussion may be summarized in 
another lemma: 

Lemma II. If f{x) is a summahle function which vanishes 
identically for a — + ^>0, its Fouriei' 

series converges uniformly to the value 0 for a <x ^ fi. 

The proof will be expressed in terms of the Lebesgue 
theory of integration, and, unlike those that have gone before, 
would have to be appreciably modified in form if reference 
to that theory were to be eliminated. A more elementary 
method of proof appropriate to the case of functions bounded 
and integrable in the sense of Riemann will be indicated later. 
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Let r have a value belonging to the interval te < ./• • - 
By the usual formula, 




1 /*’' I 

9 7 I /(Ocot „ (/ — .r) sinw(/— .r)rf/ 

+ ,r r /^O cos n(< — .r)rff. 


The second of the two terms making up the last ex))ression 
is equal to J(r/nCOsn.r h 6/, sin n.r). and approaches zero 
uniformly for all values of j-, as an immediate consequence 
of the fact that a„ and hn approach zero. 

In th(‘ first term, let f — ,#• -= u. Then the integral is 
equal to 

Jn^or) - ^ /*(./• -j- f() cot 2 sin n u d n. 


IjCt (\ii) be a function of period 2 .t which is equal to 
cot i ff for - ' rr V — rj and for // n tt, equal to 0 
for i ‘ ' n < i and so defined for — J? - n < — i i? and 
for hfj y rj as to have a continuous derivative every- 
where; it is scarcely necessary to write down an explicit 
formula for the construction. As is identically zero 

for — <11 when .r is in («. /®), the integral ./„(.r) is 
the same as 

-1- ») sin n ti d H . 

Let 

Jl^/(j' + «)sinwi((f» ^ F(j/); 

the values of n and t are for the moment to be regarded as 
fixed. The function F(y) has fTlmost everywhere a derivative 
equal to /(j;-l-j/)sinw?/. Consequently, as C(?() is differ- 
entiable everywhere, the function Fiu) C(u) has almost 
everywhere a derivative equal to F'{ii) F{u) C' {ii). 

Furthermore. Fiu) is absolutely continuous, and (7 (h) , having 
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a bounded derivative, is absolutely continuous also. Hence 
F(«) 6 ’(m) is absolutely continuous, and its increment over 
an interval is equal to the integral of its derivative over 
the interval. That is, in particular, 

F{iT)C{n)-F{—n)C{—.T) 

F' («) (,’(«) d ti + Fin) C (m) d u , 

Jnipr) —■ C(m) du 

-- F{n) (\jx) — F(— /r) Ct— 7i) — r F{u ) ( ” (h) d u . 

pJ—7T 

Hut C{7i) -- C{ — 7i) cot (-+ 12 ^) - 0. By an earlier 

paragraph, for -/r ^ ti < n, where £,t is in- 

dependent of vt; if is the maximum of the continuous func- 
tion |r'(u)|, which is independent of ./■ and 7i, 

\Jn(or)\ — ^F{ti) (y{n) du\^ < 

So Jn{^) api)roachos zero uniformly for the values of oc in 
question, Sn{x) does likewise, and the lemma is proved. 

It follows at once that if f{x) and y {x) are two summable 
functions, identically equal for « — + aiid if 

the Fourier series for y (x) converges uniformly to the value 
y(x) -^f{x) for jS, the series for f{x) does the same. 

In particular. Theorem 1 may be further supplemented by 
Corollary 11. If f{x) 2 s a summable function of period 2n, 
and if there is an interval « — ^ ^ ^ ^ 0, through- 

out which f{x) is continuous, with a modulus of coyitmuity w(d) 
such that linitf^ow(d) logd — 0, then the Fourier series for 
f(x) converges uniformly to the value f(:x) for ^ ^ x ^ 

2 . Convergence of Fourier series under hypothesis 
of limited variation 

As the next pages will be concerned largely with functions 
of limited variation, the inseition of a proof of the theorem 
about the convergence of the Fourier series for such a function 
is not out of place. 
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Let f{x) be a function of period 2n, having limited 
variation over any finite interval. Since a shift of the 
origin from which x is measured does not change either the 
character of the function or the terms of its Fourier series, 
when sine and cosine terms of like order are taken together, 
it is sufficient to consider convergence for x = 0. 

Since / (a:) is of limited variation, its discontinuities, if any, 
are finite jumps. Let ^{x) be a function of period 2n which 
is linear for 0<a:<27r, and further defined by the con- 
ditions V/(0+) =/(0-f), V'iO— ) = ^{27t—) = /(O— ), 
*/'(0) = H/(0+)+/(0 — )]. If f{x) is continuous for x = 0, 
tp{x) is merely a constant. If f{x) is discontinuous for x = 0, 
tp{x) is equal to the constant H/(0+)+/(0 — )] plus an 
odd function, and its Fourier series consists of this constant 
plus a series of sine terms, all of which vanish at the origin. 
So the partial sum of this Fourier series not merely approaches 
</>(0) (in accordance with Corollary I above) but is always 
exactly equal to ^>{0). As J(x) — *p{x)-\-[f{x)—tp{x)], 
the problem of convergence for f(x) reduces immediately to 
the corresponding problem for the difference f{x) — ^{x), 
which is continuous at the origin, if f{x) is defined for 
X = 0 as equal to the mean of its limiting values. There 
is no loss of generality therefore in assuming at the outset 
that f{x) is continuous at the origin and vanishes there. 
This assumption will be made henceforth. 

By the hypothesis of limited variation, f{x) can be ex- 
pressed for — n <x^n in the form f{x) = g>i (x) — <Pi(x), 
where and y, are bounded and monotone increasing 
throughout the interval, are continuous wherever f{x) is 
continuous, for x = 0 in particular, and vanish for x = 0. 
For the study of the expression 


let 


S„ (0) 


1 f fff) sin (n + t 
2sm\t 


dtj 


ill 



sin (n + ^ 
2 sin ^ < 


dtj 
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let ji^ be the integral similarly formed with in place of 9)1, 
and let jtx and jtt be the corresponding integrals from — tc 
to 0. Then 5'„(0) (l/^)0‘ii— iia+iai— >22)- 

If and are any two numbers belonging to the interval 
(0, tt), and if n is any positive integer, 


p sin(„ + t) (^ 
2 sin ^ ^ 


3/r. 


For the integrand is alternately positive and negative over 
intervals of length the arches of its graph dimin- 

ishing steadily in height from left to right. The integial 
over an interval corresponding to any number of whole arches 
is therefore a sum of diminishing terms of alternate signs, 
and its magnitude does not exceed that of the largest term, 
namely the area of the first arch involved. The integral 
from any Sj to any is made up at worst of such a sum, 
plus the integrals corresponding to parts of two other arches. 
But the integrand, being equal to i4-cos^+ ••• -f-cosn^ 
never exceeds n + \ in absolute value, and the area of any 
one arch, with base can not exceed n. So the 

magnitude of the integral from $1 to can not exceed Btt. 
(It is almost as easy to see that the value is actually less 
than 2nj and a still lower bound could be obtained, but the 
inequality as written is sufficient for the i)urpose in hand; 
the essential thing is that the right-hand member is independent 
of ?i, hj and 71.) 

If 0 <d < h the number d being regarded 

as fixed, 

sin (n + i)/ I 1 

2 sin i ^ i ^ sin ^ d ’ 


For the absolute value of the integrand never exceeds and 
the magnitude of the integral can not exceed the sum of 
the areas of three arches, each of base 71/(71 -f i) and of 
height not greater than 

Let € be any positive quantity, and let d be chosen (by 
virtue of the continuity and vanishing of 9^1 at the origin) 
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SO that 5 Pi (6) <' f/(6nr). The integral defining jn may be 
taken as the sum of the integrals from 0 to d and from d 
to n. The second law of the mean may be applied in each 
of these integrals. For that from 0 to d it gives 



sin (« + < 

2 sin ^ f 



sin (n + ^)t 
2 sin ^ t 


dt, 


where 5 has some value in the interval (0, d); by the second 
paragraph preceding, the absolute value of the integral on 
the right does not exceed 3/r, and consequently that of the 
integral on the left does not exceed .Vf, for any value of n. 
For the rest of y, , , 



sin (« + i) < , , 
2,inK 



sin (w + t 
2 sin f 


dt 


■f 5Pi 



sin + . 

2 sin if 


the new I being in the interval (d, tt); here <px{S) < 
and the absolute value of each integral on the right, 
by the preceding paragraph, is less 'than or equal to 
37ir^/(n-i- i). So 

sin (» 4- i) < , . i 6 ^ 

2 sin i f ! w -f- i ’ 



which is less than ie as soon as n is sufficiently large. This 
means that jn approaches zero as n becomes infinite. By 
similar treatment of j»i 2 , J 21 , and it is recognized that 
SniO) approaches the value 0 ^ /(O). 

For functions having the original degree of generality, 
the result may be stated in the following form: 

Theorem II. If /{q') is a function of period 2rc having 
limited variation over a period, its Fourier series converges 
to the value fig') at every point where figi) is continuous, and 
to the meayi of the values apjwoached from the right and from 
the left at every point where fix) is discontinuous. 

K fi^) is discontinuous at the origin, the details of the 
above calculation apply properly not to fix) itself, but to 



II. LIMITED VARIATION; ARITHMETIC MEANS 


47 


the difference /(a:) — ^{x), the retention of the symbol f{x) 
having been equivalent to a change of notation. Let it be 
supposed now that ./'(.r) is continuous throughout. Then the 
complication just mentioned docs not arise (except for the 
subtraction of a constant to make /(O) = 0, which is of 
no consequence). The functions U) and ry'sW niay be 
taken as the positive and negative variations of /Qr) itself 
(in the sense that (j) is the positive variation from 0 to 
X when x is positive and minus the positive variation from 
X to 0 when .r is iK^gative, being similarly defined;. The 
use of the origin as nqiresentative of an arbitrary point ./„ 
in the convergence jnoof amounts to another change of 
notation, in connection with which in the course 

of the reasoning, takes the place of 91 (./•« -f- — ^1(^*0). 
Under the present hypotheses, and 9.* are everywhere 
continuous, and so uniformly continuous. Hence the choice 
of d, if the proof is written out in terms of the general no- 
tation, can be made independently of xo. For any Xo. further- 
more, + 7i) — (j>i{Xii) and u>i(xo-\-x) — 9^*(/*o) can not 
exceed the total variation of f{x) over a period, a quantity 
likewise independent of xo. These are the essential points 
needed to Justify the supplementary assertion: 

ConoLLAUY I. If j ix) saf/sjies the hypotheses of Theorem II, 
and is furthermore continuous ererywhere, the Fourier scries 
conceryes to fix) uniformly for all ralues of x. 

If it is assumed merely that /(a) is of limited variation for 
Xq — fj X < Xo-\-f/, and summable over a period, the 
function 9>(.^) which is equal to /(u) for xo^^i <x<Xo + r^, 
and identically zero over the rest of a period, is of limited 
variation over the entire period, and its Fourier series converges 
at to the value i [9'Cro+) + yC/o— )] -- i [/(^-u+)+/(.^o— )]. 
from which it follows that the Fourier series for f{x) does 
the same. If f{x) is continuous and of limited variation for 
« — V ^ ^ 7 iiiid summable over a period, the 

function (f{x) of period 2n which is equal to f{x) for 

— V and linear for <x<a — // + 27r, 

is of limited variation and continuous over the entire period, 
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and its Fourier series converges uniformly with y (a:), for its 
sum; the Fourier series for f(x) — fp{x) converges uniformly 
to zero for a <x < JS, by Lemma II; and consequently the 
series for f{x) converges uniformly to f(x) for 
These conclusions may be expressed as 

Corollary II. If fix) is a function of period 2/1, summahle 
over a period, and of limited variation fco' Xq — ^ ^ ^ 
its Fourier series converges fm* x = Xo to the value 
\{f{xQ-\-)-\rfixQ — )]; if f{x) is of period 2n, summahle 
over a pefi'iod, and continuous and of limited variation foi' 
a — ri X < its Fourier' series converges uniformly to 

the value f (x) for cc ^x < fi. 

It may be noted in passing that the second law of the 
mean leads to a proof of Lemma II for functions that are 
bounded and integrable in the sense of Riemann, without 
the use of Lebesgue integration. In the expression which 
was denoted by Jnix), in the proof as given previously, let 
the integrals from — n to — rj and from // to n be considered 
separately ; the integrand is identically zero for — ^ • 

Since coti w is monotone from // to n, and cot^rr — 0, 

J f{x-\-u)cQi~\i%m7ixidu |cot f(x-\-u)siiinudUy 

where S is a number of the interval (//, n). The magnitude 
of the last integral does not exceed the quantity 8^^, which 
is independent of x and approaches zero as n becomes infinite. 
Similarly it may be shown that the integral from — tt to — jy 
approaches zero uniformly, and the conclusion of the lemma 
follows at once. 

3. Degree of convergence of Fourier series under 
hypotheses involving limited variation 

Attention will now be directed once more to questions of 
degree of convergence. The next theorem is a rather simple one: 

Theorem III. If f{x) is a function of period 2n with 
limited variation, the total variation over a period being V, 
then, for n > 0 , 
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cosnxdx 




sill nx dx 



Let 9 Pi(a:) and 5 Ps(a;) be the positive and negative variations 
of f{x), starting from the point — n, so that/(a:) =/(— w) 
+SPi(x) — 9»{x), 9P,( — n) — <Pi{—n) — 0. Since /(w) =/(—«), 
y>i(n) — <jp*(7i) — 0. But SPi (tt) + 5 pg (tt) = F, and therefore 
fiin) — 9 a in) — ^V. By the second law of the mean, 


£ /*71 

^ 5Pi (x) COS nxdx — <iPi (ti) cos nxdx 

~n / 

J *71 IF 

yi (x) cos nxdx I < . 

-71^ ^ ' I — 2n 

r ^i{x)c,o^nx dx \ < -r- , 

71^ ^ ' I - 2w ^ 

J^^/( — n)ii0^nxdx — 0. 


Similarly, 

while 


Fsinw? 

2n~' 


So the first inequality of the theorem is obtained. A mechan- 
ical repetition of the reasoning for the integral with sin nx 
would lead to the expression cosw? — cosmtt in place of 
sinnTT — sinw?, and the new expression has the maximum 
value 2 instead of 1, since cosn/r^'O, But by virtpe of 
the periodicity the integral from — tt to tt is the same as that 
from — TT + [nr/(2 w)] to n-{’{nj{2n)\, and if the variations 
are measured from the left-hand end of the latter inteiwal the 
upper bound VI n is obtained once more, as stated in the 
theorem. 

Suppose now that /(a:) has a first derivative with limited 
variation, and let V be the total variation of /' {x) over a 
period. By integration by parts. 


J* (x) cosnx dx = sinnxdx 

1 P'* 

= — — I /' (x) sin nx dx. 
n o-ir 


4 
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whence it follows further, since Theorein III is applicable to 
the last integ:ral, that 

. 1 I V 

I I /(./•) cos >/./• — I f' { r) smn.r flr\ ' . .. 

ff ^ U J-t:' ' 

Similar reasoninji: applies to the integral with sin w;r in 
place of cos 7ix, Kepetition of the process of integration 
by parts leads to the following: 

Corollary 1. Jf ,t {jc) is a fnnvtiim of pet' iod 2n tvhich 
has a pth denvative icith limited variation, p T'" 0, and ifV 
is the total variation of f^^^^(.r) over a petiod, then, for 



f(.v) cos n.r d.r 


V 



sin n r d.v 


y 

uP ^ 1 • 


The conclusion is really somewhat more general than tin' 
statement would indicate. Suppose for simplicity once more 
that p =- 1. As will be seen on r(‘-examination of the 
proof in the light of well known theorems on Lebesgue inte- 
grals, the essential thing is not that fix) be uniquely de- 
fined at every point, but that f{x) be expressible as tlie in- 
tegral of a function of limited variation: 


y'(/o+ r 




for any value of a. This observation is of some interest, 
since the simplest functions represented by graphs with cor- 
ners satisfy the modified hypothesis, but not the original 
one. The generalization carries over to the applications of 
the Corollary in Theorems IV and Vb below. 

For the sake of another corollary, let f{x) be identically 
zero for — // < x"- x,, -f - , and of limited variation over a 
period. If is represented once more by the formula 
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both f{xQ-\-u) and yVo + n) cot are of limited variation, 
regarded as functions of u, and conseqiuuitly |*Si /•„) do(*s 
not exceed a constant multi])le of 1/w. W'ith regard to 
uniformity of convergence, let /(r) be identically zero foi 
u — the hypothesis of limited variation over 

a period being kept unchanged. When « ; ./'o the 

function /fc, + h) cot J // is identical with y V« -f- ('0» it’ 
Cl (u) is defined as equal to 0 for 0 i /f : • . // , and equal 
to coti?/ for ^ ^ 1^1 - TT. In general, if (pi (n) and 
are any two functions of limited variation, the total variation 
of yi (?e) being Vi and that of yg (?/) being T's, and if Mi 
and M 2 are upper bounds for | y, (//) and I y^ (u) re.spectively, 
the function yhOo is of limited variation, and its total 

variation does 'not exceed J/i 1^ + J/g . In the ])resent 
instance, yi (/O and y^Cn) being replaced by ./Y./o I ?<) and 
(\ (?/) respectively, /(./'o + u) has the same total variation 
over a period and the same bounds as y’(./*). for any value ot 
.7’o, and C\ (?/) is independent of xq. so that the total varia- 
tion of /(ro + u) cot while presumably different for dif- 
ferent values of .a,, has an upper bound independent of 
as long as x^ belongs to the interval (« , /?). Such an upper 
bound can be calculated more specifically as the product 
of F, the total variation of fix), by a quantity depending 
only on //. (The absolute value of fix) can not exceed T" 
anywhere, since every period contains points where /(x) 
vanishes.) It is possible theiuifore to state 

Corollary II. If fix) is a function of period 2/1 irith 
limited variation, the total variation over a. period heiny V. 
and if fix') vanishes identically for u — then 



for a < X ^ where Sn (r) is the partial sum of the Foaritr 
series for fix), and Crj depefids only on /^. 

Each of these corollaries, taken in conjunction Avith results 
obtained earlier, leads at once to a theorem on degree of 
convergence. If ^ 1 iR the hypothesis of Corollary I, it 


4* 



52 


THE THEOBY OF APPROXIMATION 


is known that the Fourier series for fix) converges uniformly 
to the value fix)\ this follows from Theorem IX of Chapter I, 
and more directly from Corollary 1 or Corollary Ila of that 
theorem, since the hypothesis provides fix) with a bounded 
first derivative, and 

!/(«») —fixi) I ^ ^ 1 3-1 — a-i I , 

if A is an upper bound for \f ix) \ , On the other hand, Corol- 
laiy I, interpreted in terms of the Fourier coefficients, 
states that 

1 «„ I < F/(7»nf I I ^ V/innP+^). 

Hence 

|/(a:) — 5„(a:)| = ^ iakCoskx-\-bkSmkx) 

I 1 

< 2 (laAi + l^fci) 

<2 S F 2F r” du 2F 
= k^+inkf-^^— n Jn pnnP' 

The conclusion may be formulated as 
Theorem IV. If fix) is a function of period 2 n which 
has a pill derivative with limited variation, p ^ and 
if V is the total variation of f^^^ (a-) over a period, then, for 
n>0, 

\fix)-Sn (a:)| ^ ^ 

where Sn{x) is the partial sum of the Fourier series for fipr^, 
Qp is a constant dependimj only on p, and Q is an absolute 
constant: more specifically, Qp = 2l{pn), Q -=^ Qi 2ln, 
This result is to be compared with Theorem X and the 
corollaries of Theorem IX in Chapter I. 

Corollary II will be combined with Corollary II of Theorem IX 
in Chapter I, and with the theorem just obtained. In connection 
with the result from Chapter I, it is to be noted that if /(x) 
is a continuous function which is not identically constant, 
and if ©(d) is its modulus of continuity, 

lim inf > 0. 
cT-o ® 
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For if this wore not the case, it would be possible for every 
t 0 to find aibitrarily small positive values of d such that 
«(d);=Cod; for any such d, i/(xa) — /(.7,)1 < t|T2 — if 
•r* —xi is an integi’al multiple of d, since |/f.r, + iVd) — /(a.-,) I 
^ A’^wfd) < « . A’^d, if N is integi'al; but for any choice of 
xi and Xs, it would be possible to find values of d sati.s- 
fying the condition tliat w(d) < * d. and so small that jj — aj 
differs arbitrarily little from an integral multiple of d; and 
then it must still be true by continuity that !/(a;) — /( ji )1 
— avi) which in view of the arbitrariness of t means 
that ./’(.r^) — - 0 for all a-j and aj, contrary to the 

hypothesis that J'(x) is not constant. The conclusion may be 
stated by saying that «(d)/d has a positive lower bound for 
values of d that are sufficiently small; as w(d) itself has 
a positive lower bound over any interval not reaching to the 
origin, it appears further that w(d)/d has a positive - lower 
bound over any finite range for d. 

Suppose now that /(x), having the period 2n, is con- 
tinuous with modulus of continuity w(d) for a — // ^. ' x^/S-j- tj, 
but not constant over the interval, and of limited variation 
(but not necessarily continuous) over the rest of a period. 
Let y(x) be periodic with period 2n, equal to /(a) for 
a — >1 < X < jS-j- If, and linear for jS-j- ^ -^x : « — i; -j- 2 n. 
In the latter interval, the modulus of continuity of fix) is 
a constant multiple of d, which by the preceding I’emarks 
docs not exceed a constant multiple of «(d), say /i'e»(d), over 
the range within which a)(d) is defined, namely for 0<d 

< — « + 2iV. Ifd .satisfies the latter condition, any inter- 

val of length d Is made up at worst of an interval cougi’uent 
(modulo 2 nr) to (^-f 17. « — ^-f-27i), together with parts of 
two intervals congiTient to (« — and so 9 >(x) has 

everywhere a modulus of continuity a)i(d) which for 0<.d 

< fi — « -f- 2^ does not exceed (fr -j- 2) w(d). By the corollary 
cited from Chapter 1, y (a.) differe from the partial sura of its 
Fourier series by not more than Ami(2Tr/n) logn, and so 
by not more than .^.(/ir-f 2) o)(2n/n) log m, if n is large enough 
.so that 2n/n comes within the specified range of values for d. 
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On the other hand, — is of limited variation, and 
identically zero for a — .r < fi 1 ; , so that the recent 
Corollary II is applicable; and the quantity 1/n which enters 
into the conclusion of that corollary does not exceed a constant 
multiple of to(2n/n). So it is possible to state 
Theorem Va. If the function fix), of period 2n, is con- 
tinuous tvith modulus of eonthmity ®(d) for a — >i 
where a»(d)>0 /or d>0, and of limited variation {hut not 
necessarily cotdinuous) over the rest of a period, then 

fix) — <Sn(jO I- CM (2 njn) log n 

for « < X < d, if n is large enough so that (oi2nIn) has 
a meaning, Snix) being the partial sum of the Fourier series 
.for fix), and e a constant deitending neither on x nor on n. 
The combination of Corollary II with Theorem IV (or 
rather with a part of that theorem) proceeds with a little 
more facility. Let fix) be a function of period 2 n having a 
first derivative with limited variation fora — + 
while fix) itself is of limited variation over a period. 
Let fix) this time be of period 2n, equal to fix) for 
« — 11 , and defined for/84-^<x^ a — t/ + 2;r 

as a polynomial of the thii-d degree so that fix) and its 
derivative have determinate values at both ends of the interval. 
Then f (x) has a first derivative everywhere, which is of limited 
variation over a period, and/(x) — fix) is of limited variation 
over a period and identically zero for a — iy < 

It remains to apply Theorem IV to fix), and Corollary II 
to fix) — fix), and the following theorem is obtained: 

Theorem Vb. If the function fix), of period 27r, is of 
limited variation over a period, and has a first derivative of 
limited variation for a — ^ < x ^ + then 

|/(a:)— 5„(a:)| < 

for cc < X < jSy ivhere /SnU) is the partial sum of the Fourier 
series for fix), and c is a constant depending neither on x 
nor on w. 
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Space will not be taken here for the working out of more 
elaborate combinations of similar character. A theorem re- 
lated to Theorem III, however, will be obtained for the sake 
of a subsequent application. 

It was shown early in the chapter that the Fourier co- 
efficients of an arbitrary summable function of period 2n 
approach zero as a limit. Let /(x) now be a function of 
period 2rr which is absolutely continuous, and let a*, bk be 
its Fourier coefficients. Its derivative exists almost every- 
where, and is summable. The product /(.r) sin lex is like- 
wise absolutely continuous, and so has a derivative almost 
everywhere, and its change of value over an interval is 
equal to the integral of its derivative. Specifically, it has 
a derivative equal to f' (x) sin Jcx-\-kf(x) cos it a; at every 
point where /' (a:) exists. Consequently 

0 -- f{n) sin kn — /( — n) sin {—kn) 

/^TT nn 

^ I /' (j) kx dx-\-k \ f (a;) cos kx dx, 

1 C 

kak= I f'(x)sinkxdx. 

7t TT 

But by the theorem cited at the beginning of the paragraph, 
the last integral approaches zero as k becomes infinite, since 
f'(.x) is summable. Similarly, kbk approaches zero. 

It was seen at the beginning of the chapter that 

^ J [/(.t) — Sn (x)]* dx 

+#.«+«]• 

But under the hypothesis of the moment f(x), being abso- 
lutely continuous, is a fortiori of limited variation, and hence 
(S'nCa) converges uniformly to f{x). So the. left-hand member 
approaches zero, and the right-hand member must do the 

‘ J' L/'wr d.T-^f+ . 


same : 
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(It is still not necessary for the purpose in hand to establish 
the well-known fact that the last relation holds under much 
more general hypotheses. Use of the theorem on the con- 
vergence of the Fourier series for a function of limited variation 
can be avoided by reference to the least-square property of 
the sums iS'»(a;), discussed in the next chapter, together with 
the fact that by Weierstrass’s theorem trigonometric sums 
Tnix) can be found so as to make f{x) — TnCa;) approach 
zero uniformly.) From the last two equations, taken together, 
it appears that 

\Vmx)-Sn{x)Ydx:^ Z (a2-fhk). 

n tf- 7 C fc = B-f.l 


Let e be an arbitrary positive quantity. It is possible to 
choose w„ so that | A- a* | ^ lAhfcl < for A > m#- 

Then, if « ^ no, 


Z (ai-i-bk) < « 
which means that 


z 

Ac=»-fl 


A:- " Jn i( 


(In 


limn Z (afc+6fc) = 0. 

n— 00 fc=.n-rl 


€ 



The results of the last two paragraphs may be summarized in 
Theorem VI. If f{x) is an ahsohitely continuous function 
of period 27 t, and if On, bn are its Fourier coefficients, 


lim nan = 0, lim nbn = 0 . 

n -- 00 n = oo 


If Sn{x) is the partial sum of the Fourier series far /(a;), 
and if Yn is defined hy the ecpmtion 


then 


Yn = J ^ [fix) — 5 n(a;)]* drr, 
lim nrn = 0. 

n = oo 


The first part of this theorem supplements Theorem III : 
the products nan, n bn are bounded for any function of limited 
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variation, and approach zero if the hypothesis of limited 
variation is replaced by the more stringent requirement of 
absolute continuity. The last part of the theorem will be 
used in the next chapter. 


4. Convergence of the first arithmetic mean 

The next undertaking will be to outline a theory of the 
convergence and degree of convergence of the approximating 
functions with which the name of Fej»‘r is associated, the first 
arithmetic means of the Fourier series for a given function. 
The mean in question is defined by the identity 

(*) ^ -V ['^0 (^) + -STx (a:) + . . . + (a-)] , 

where SkU'), as usual, denotes the partial sum of the Fourier 
series through terms of the ^rth order. 

It will be recalled that the fundamental integral expression 
for Sk(oc) involves a factor sin(fc-|-i)(< — x), and otherwise 
does not change with k. The product of the expression 

sin Y t; + sin Y i! + 1- sin |w — yj v 

by 2 sin J v can be rearranged in the form 


[I — cos w] -f- Icos V — cos 2 . -f [cos (w — 1 ) u — cos n v] 

= 1 — cos ni; -- 2sin*(ni;/2). 

Application of this identity, with v replaced by < — x, gives 

or in tenns of a new variable u = — a;): 




sin*nM 

sin*M 


du. 


If (sin*nM)/(sin*tt) is denoted for brevity by <!>»(«), the ex- 
pression becomes 


^ 4- 2«) ®„ (u) du. 
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The fact that the trigonometric factor Oa{n) in the inte- 
grand, unlike the corresponding factor in the expression for 
is never negative, has important consequences, and 
materially simplifies the reasoning. If f{pr) in particular is 
identically 1, each reduces to the single term 1, and 

(r,t(T) also is identically 1, for all values of w: 


1 


1 sinH M(/ — r) 
n n 2 sin* A (f — a) ^ 



(?0 (lf(. 


More generallj% then, if f{x) has J/ as an upper bound for 
its absolute value. 

\^u{oo), -\~2 h)' ( iKi (h) (In 

)l 71 V -7r,2 

-- - I ‘ M On (it) if >1 = M, 

- nn J-jr ,2 ' ’ 

(/ 1 /( 3 ^) i S I'ulues of j\ then j^nCOi 1; for all 

values of x liliewise, and for all values of n. 

For any specified value of x, the identity in the precedinjr 
paragraph, multiplied by the quantity /(./•) , which is constant 
as far as the variable of integration is concerned, states that 

whence 

fin (a) —fU) = — ^ [f{x 2 n) — /(.x)] ©„ («) da . 


Suppose now that /(r) is summable over a period, and 
continuous for x - xo. Let e be an arbitrary positive 
quantity, and let 6 nl2 be chosen so that |/(xo4-2m)— /( xo): 
•> for [u\ -f (5. Then 




1 f*'/* 1 

— I nfO„{a) da 

n n J-JT 2 2 ' 


1 

nn J-ff 2 


(DnWdu 



for all values of n. On the other hand, since l/(sin*d) 

for d < luj n/2, 
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^ [Lnl -2 +.C ) — ./(^o)] ®H (h) dn j 

^ ^sin*d J-.« -Ax,)\dn-, 

the last integral exists, by the hypothesis of summability, 
and is independent of n, so that the whole expression is less 
than ie as soon as n is sufficiently large. Therefore 
<i»>(a;o) — f{?co)\<t, when n is sufficiently large; ff„(jro) 
converges toward /(a^o). If /(a:) is continuous everywhere, 
tf„(r) converges toward f{x) for all values of x, and the 
convergence is uniform. P^r d in the proof can be chosen 
independently of ao, since fix) is uniformly continuous, and 
if M is the maximum of \fix)\, 

J ’nii 

|/(ao4-2u)-/(a-o) du^2Mn, 

~7r/2 

which is likewise independent of a-o. 

For convergence at a point of discontinuity, let fix) be 
summable over a period, and have a finite jump for x = xo. 
Let fix„) be defined as the mean of the limits approached 
from the right and from the left. Let a function »/’(.?•) be 
defined by the requirements that it shall be of period 27 r. 
and linear for a-o<a''--.ro + 2/r, and that ff>ixo-{-) - /(•^•o+), 
•/'(a’o— ) =/(a'o— ), '/'(J’o) = fM. For x -= xo, each 
partial sum of the h'ourier series for tpix) is exactly equal 
to ^ixo), as was pointed out in substance at the beginning 
of the proof of Theorem II. The same is true therefore of 
the arithmetic means of these partial sums. But fix) is the 
sum of tpix) and a function continuous for = and the 
arithmetic mean formed for the sum of two functions is the 
sum of the corresponding arithmetic means. So the arith- 
metic mean ix) formed for fix) converges for x — Xo to 
the value fioco). 

The results on convergence obtained thus far may be 
restated in 

Theorem VII. If fix) is a summable fundioti of period 2n. 
and (aj) the arithmetic mean of the first n partial sums of 
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its Fourier series, as defined above, <fn (x*) converges Unrnrd 
fix) at every point where fir) is continuous, and converges 
toward the average of the limits approached from the right and 
from the left at every point where fix) has a finite jump, 
ff fir) is continuous everywhere, the convergence is uniform 
throughout. 

It should be mentioned as an immediate consequence of the 
definition of the arithmetic mean that ifSnix) converges at any 
point, (Tnix) converges to the same value. In fact, if -ScaV] ,&,••• 
is any convergent sequence of numbers whatever, with limit 
and if a*,, = OSo + S'i+ • •• +Sn~i)/n, then a,, also converges 
to the limit S, Let aV — aS/^-= Jt/c, nnd let N be chosen so 
that j Eu for k N, Then, if n'.' X, 

V _ 4“ • * * ~ f~ l^n-\) 


Eo + E^ + . > +AV i 

n 




_j_ iL ■ ' ’ .^7 ^ 


the last fraction is less than ie in absolute value for all 
values of n > X, and the preceding fraction, in which the 
numerator is independent of 7i, is less than for n suffici- 
ently large. 


5. Degree of convergence of the first arithmetic mean 
As a first hypothesis for the study of degree of convergence, 
let fix) satisfy everywhere the condition 


l/M —/(Xi ) I < ^ i a:* — j:, ! . 


In the integral expression for <fn{x) — f(x), 
fix + 2u) —fix) I ^ A I 2 n I , 


so that 

‘2 A 4 A 

Wnix)-f(x)\ f ^ ^Jo 
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In dealing with tlie last expression, it is to be remembered 
that «/sin/t ^ 7r/2 for 0 <m < 7r/2. Let the integrals from 
0 to 1/n and from 1/n to nl'i be considered separately. 
In the former, let the integrand be expressed by the formula 




n 

sin?* 


■ sm nu ■ 


siiiJi?* 
sill ?t ’ 


of the three factors indicated, all of which are positive for 
0<u^ 1/m, the first does not exceed nl2, the second does 
not exceed 1, and the third does not exceed n. Hence 



It 

o' • 


In the integral from 1/m to it!2. 


sm* u 


and 


/■•-''/S TT* C^'du 

I , 7 - = 

" 4 Ji/« K 


1 1 
/( = 4 ■ V’ 

‘I +10gM|. 


It follows that the whole inte<?ral from 0 to /t/ 2 does not 
exceed a constant multiple of log?«, and \ — /C/’)| does 

not exceed a constant multiple of (logw)///. A slightly 
more specific statement of the result is 

Theokf:m VIII. if f{^) a function of period 27i satis- 
fying everywhere the condition 


^ J (^’2) J (‘^*1 ) I ^-1 “^2 ^'1 1 » 


and W the corresponding arithmetic mean^ then, for 




Co l log n 


wheie Cq is an absolute constant. 

Let fix) have a derivative satisfying everywhere the 
condition 


I /' (r* ) — /' (.r, ) I ^ ^ I .Tj — .?•, |«, 
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with a positive value of «. The error then can be somewhat 
more narrowly restricted. The factor On(u) being an even 
function of u, the substitution of — it for it gives the integral 
expression for the error the alternative form 

ffn W —/(.t) = — I [fix. — 2«) —J (r)] <D„ (ft) du 

J71!2 

== I (/(a- — 2it) —/(»)] <!>»(«) dit, 

tin 

and a third expression is obtained by taking the average of 
this and the original one: 

<^n(x)—fi-r )=\ — I [/(J’+2«1— 2/(.r)-t-/(a— 27<)]d>»(ft)<l«. 

innj „i2 

By the mean value theorem, together with the hypothesis 
now in force, 

J(x + 2 h) —fix) 2 m/' (?,), 
fix — 2u)—fix) == — 2 71 /'(?*) , 
fix + 2 ft) — 2 fix) -\-fix — 2 It) = 2 u [/' (?,) —■/' (?*)] , 
l/(a: + 27t)-2/(x)+/(rr-27t)| < 2 A i it | • | 

< 2’*“+* A I 74 "Tl, 

the numbers and J* being in the intervals ix,x + 2 it) and 
ix — 2 It, a:) respectively, so that | < 4l7tj. Hence 


But 


92« 2 

ix) -fix)\< — - - It l“+‘ (m) du 

nn J-7c/-> 

92«-f 1 2 . 

— I <2>n (w) d\l . 

nn Jo 


<Pn(M) ^ 




sin* It “ 4 » 

in the interval of integration, so that 

220-: 


linix)—fix) \ < 


' li-n du 

h' Jo 
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which does not exceed a constant multiple of 1/n, the last 
integral being convergent. In more formal statement: 

Theorem IX. If fir) is a functioyi of imiod 2n haring 
a first derivative which satisfies everywhere the condition 

\f ifCs) —y'Ui) I 'I 

with «>0, and ifan(x)is the rorrespondiny arithmetic mean, 
then 

:/W-.,(r) '“A. 

where C« ts a constant depending only on u . 

No higher order of apiiroximatioii would be obtained by 
supposing f{:r) provided with additional derivatives. In fact, 
the arithmetic mean corresponding to tlie analytic function 
f(j') cos a; is (.r) - [(n — l)/?i]cos.r, and the error 
is actually of the order of 1/w. 

Let /(./•) be an arbitrary continuous function of period 2 n, 
with modulus of continuity o) (d ) . The theorem to be obtained 
in this connection is perhaps of secondary interest, because 
a closer result is given for an important class of cases by 
the theorem following it ; but it also covers cases not admitted 
by the hypothesis of the later theorem, and so is not entirely 
superfluous. The proof is an adaptation of that of Theorem II 
in Chapter I. Let ^ {x) be a continuous function of period 2 n 
which is equal to f{x) for a set of values of x dividing 
a period into yi equal parts, and is linear between successive 
points of this set. This y (.r) satisfies the hypothesis of Theo- 
rem VIII, with A -> [o)(2n/)i)]/{2n/yi), and is represented 
by the corresponding arithmetic mean with an error not 
exceeding a constant multiple of <a{2 nhi) log n. The absolute 
value of the mean corresponding to the difference /U)--y (o’) 
does not exceed the maximum of the absolute value of the 
difference itself, which is not greater than 2(a{2nhi), and the 
error of this mean can not be greater than 4w(27r/M)‘ Hence: 

Theorem X. If f{x) is a conthiuous fnyiciion of period 2 n, 
with ynodidus of conthuiity a)(d'), and (JnQr) the cor respond mg 
arithmetic meayi, then, for yi'^l, 
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'/(.r) — (.r) I < To I log V . 

irJicre Co is an absoliife constant. 

Let /(.r) be subjected to the hypothesis that 

!/(3’3)— ./Vi)| ^ a*,|" 


with 0 a < 1 . From the original integral expression for 
the error, 


/wi ^ 


X pr /2 

J -7cr2 

j: 


nn 
2"+^ A 


n 71 


2 H I" <Z>rt (li) d II 

• o 

!(“ On {><) (111. 


Let the interval of integration again be considered in two 
parts, from 0 to l/w and from 1/m to n/2. In the former 
interval. 


n 

«“©«(«) = — z — sin"«?<- 
sin^M 


tsinnuY'" , 

\ sin M. / \ 2 / ’ 


1 • 


so that 


From l/« to 



iC On (?<) du 



(( 


7x12, 


;C-< 



On Oo du 


4 Ji/H " 4 « 4(1 — a)' 


Substitution of these inequalities in the formula for the error 
gives 

Theorem XT. Tf f(.r) is a funvtioyi of yvriod 2 71 satisfying 
everywhere the condition 

\f(^'‘2) —fM\ ^ — or,^ I", 

with 0 <! « 'C 1 , and if (Sn {x) iif the corresponding arithmetic 
mean, then 

l/W — tf7,(ar)| ^ , 

ft 

where Cu is a constant depending only on a. 
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Two outstanding facts with regard to the arithmetic mean, 
as compared with the simple partial sum of the Fourier series, 
are that the former converges for every continuous function, 
but does not reproduce identically a function which is itself 
a finite trigonometric sum. Consideration of Theorems XI, 
VIII, and IX, in the order named, throws some further light 
on the ability of the arithmetic mean to adapt itself to irre- 
gularities in the function represented, and its inability to avail 
itself of an exceptional degree of regularity. The hypotheses 
of the three theorems imply that 

\f{cr + v) — 2f(.r) +f(r — v) 1 < ! r |“ 


with a constant in each case, and values of u successively 
less than 1, equal to 1, and greater than 1 (the present a, 
in the case of Theorem IX, taking the place of the number 
previously denoted by « + !)• The upper bounds obtained 
for the error of the arithmetic mean have the orders respect- 
ively of l/«", (log n)/w", and The corresponding 

upper bound for the error of the simple partial sum of the 
Fourier series is of the order of (log>i)/w“ in each case. 

For a concluding theorem with regard to the arithmetic 
mean, let f(jr) be of period 2nj summable over a period, and 
identically zero for a — i/. Jt x has any value 

in («, /^), f{.r-\-2u) is identically zero for — i'y > 

Hence 


Since <Z>n(/<) l/(sin*^i!/) for < \n \ < in, it follows that 


-^/2 

:/2 


On (jo) 1 ( r f 1/U- + 2 1<) I du 

nresin* \J-nl2 Jvh' 

1 


»t/2\ 


pjr/2 


< r-j-1 — i \f{x-^2ny dll. 

--- nn Sin* \i) J -nht''' 


In terms of the variable t = x-\-2u, 
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the limits of integration on the right having been adjusted 
by use of the frequently applied observation that the integral 
of a periodic function over a period is independent of the 
position of the initial point of the period. If the last integral, 
which is independent of x, is denoted by J, 




J 


2 w 71 sin* i tj 

The conclusion is 

Theorem XU. If f{x) in a fumiim of period 2 n which 
in idmiicnUy zero for a — and summahle ocer 

II period, the integral of its absolute value ocer a period being J, 
atul if is the corresponding arithmetic mean, tlmi, for 

it ^ , 1 ' ^ , 


where is a constant depending only on ij. 

Taken with the last assertion in Theorem VII, this shows 
that if /(a) is continuous for « — ij y ~ x /S-f i\, and summahle 
over a period, the arithmetic mean associated with it con- 
verges uniformly for a x ^ The results obtained by 
combining Theorem XII with Theorems VIll-Xl need not be 
enumerated at length. 

6. Convergence of Legendre series under hypothesis 
of continuity over a part of the interval 

The theory set forth in the early part of the chapter can 
be carried over in some measure to the case of Legendre 
series, the discussion being kept on the same elementary level 
which was maintained in the treatment of these series in 
Chapter I, to the extent that no use is made of an asymptotic 
formula for the Legendre polynomials. The next paragraphs 
will be devoted to a presentation of the analogies that work 
out most readily, though it will be seen that the treatment 
is left incomplete in several particulars. 

Let f{x) be summable together with its squjire for 
— 1 and let au be the coefficient of Xk{x) in its 

Legendre series, 
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Let 


>S» (./•) -- a; C- ) f «. A', (./•) + • . . + <U x„ (r) , 


as beforo. By reforence to the property of orthojyonality of 
th(^ polynomials XuM, together with the definition of (ik and 
the fact that the integral of [A^C/j)]^ over the interval ( — 1, 1) 
is 2/(2/» -h 1), it is seen that 




As th(' (piantity on the left-hand side can not be negative, 
the sum on the right is bounded for all values of u, the 
infinite series obtained by letting n beeonn* infinite* is eon- 
vergent, and 

lim 0. 

A -oc 

(ir. as e,(iuivalent statonients. 


lim «fe//r* - 

k 00 


0, lim A-’ - f /■(/) XiaOiH — 0. 

3C J I’ 


The parallelism with the case of Fourier series becomes 
clearer if the polynomials Xki r) are replaced by the normalized 
sequence {[(2/t- + 1)/2]^'- or, more superficially, if it 
is considered that the magnitude of the coefficient is less 
significant than the magnitude of the general term of the 
series: it was seen in Chapter! that |A"a(./')| does not exceed 
a constant divided by when .r stays away from the ends 
of the interval, and consecjuently 

lim HkXki.r) ' 0 

A- = JO 

unifurmhj tlmmgliont any closed inferral interior to{ — 1, 1). 

Let /(./*) again be summable together with its square, and 
suppose now that it vanishes identically for ao — c ‘ - ./‘o + . 
the interval (j-q — ro+J^/) being contained in ( — 1,1). The 
value of Sn{j\)) can be expressed in the form 
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---^ A'nfe) r .s:n+x{t)dt. 

^ J-1 .I'o — f 

p]ach of the quantities Xn-^ii^o)^ AnUo) is less than a con- 
stant divided by n}^-, and the factor in front of each integral 
is therefore less than a constant multiple of the quan- 
tity ll(xo—t) is bounded over the r.ange where f{t) is 
different from zero, and fit)/ixo-- t) therefore is summable 
together with its square; consequently, by the preceding 
paragraph, the product of either integral by a quantity of 
the order approaches zero. In other w^ords, Sn(xo) con- 
verges to the value 0. If two functions f{x), y(rr) are identical 
for Xo — fj ^x <xo-\- ri, the Legendre series for their differ- 
ence converges toward zero at the point xq. and the series 
for fix) and (fix) themselves converge or diverge together 
at that point: the convergence of the Legendre series for a given 
function at an interior point of the interval ( — 1, 1) depends 
only on the behavior of the function in the neighborhood of 
the point. 

As a preliminary to a discussion of uniform convergence, 
let fix) be any summable function with summable square 
over the interval ( — 1, 1), and let 

Akiy) ■--- fj{t)XK{t)dt. 

It is to be shown that Akiy) approaches zero uniformly 
for — 1 ^ y S 1 • The proof is largely a repetition of the 
corresponding argument for the case of Fourier series, but 
there are differences of detail which are not altogether trivial. 
For any fixed value of y, Ak(y) is the same as 

fjx{t)Xkmt, 

if /i(<) is a function equal to /(f) for — 1 f .4y, and 
v anishing for y< < < 1 , and consequently Ak{y) approaches 
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zero for the particular value of y in question. Let e be any 
positive quantity. Let 

This function is continuous, and therefore imiformly continuous, 
for — 1 S 2/ ^ 1 • Let 6 be chosen so that 

whenever [(/* — y\ Ih being points of the inter- 
val ( — 1, 1). Let .tVbe the smallest integer such that 2/jtV< J, 
and let sj — -f y = 0, 1 , • • •, .V. The points i; 

being finite in number, there is a k' such that k^<-\Ak{zj) 
for all the values of j in question, whenever k 2: k’. .For 
any value of y in (—1, 1), let ^ be that one of the uumbei's 
0, • . . . iV for which 0 < y — Zj<. d, and let Z = Zj. 
(’onsider the difference 

Akiy) - AkiZ) =,0/(0 Afc(0 dt. 

By Schwarz’s inequality, 

[0/(0 Mt) rf jjl [/(oi* '(f0 [Afc(0]* dt 
^.[A{y)-Am 0 [Afc(O]^/< 

<(|.)V2/(2A-fl)-.:(|^r/.-. 

Consequently 

k^-\Ak{y)-Aki.Z)\^ 2 *• 

ITiis is true for any positive value of k. On the other hand. 
if k '' k', k^i^l Ak{Z) \ < . For such values of k, therefore, 

Afc(y)| <e; and k' is independent of y. Let <k be the 
maximum of | Ak(y) | for — I ^ ^ 1 • The conclusion is that 

limk-=ooA*'‘^«k = 0. 
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Now let f(x) be suramable together with its square over the 
interval ( — 1, 1), and identically zero for « — ^ ^ a: < + 

where — !<« — Consider the expression 


m Xnit) 

X — / 


on the assumption that x has a value belonging to the inter- 
val a ^ rc < ^; the function An{t) has almost everywhere 
a derivative equal to f{t) Xn{i)> Let R{u) be a function 
equal to 1/u for and so defined for \u\<fj as to 

have a continuous derivative everywhere. Then, for any x 
in {ccjff), R(x — t) is the same as Mix — /) at all points of 
the interval — 1 ^ ^ ^ 1 where f{t) is different from zero, 
and the integral above is equal to 

R{x — t) dt . 

The product An{t) R{x—t) has almost everywhere a derivative 
with regard to L equal to An(t) B{x — 0 — An{t) R' {x — t), 
and its increment over an interval is equal to the integral 
of this derivative, so that 

J ^ An{f) R{x — t) dt 

= [A „ «) E (x - OIL, +£^An{t) E'ix — t)d1. 

Both B{u) and R'iu) are continuous for all real values of u, 
and approach zero as u becomes infinite in either direction. 
Let Ma be the maximum of |i2(M)|, and Mi the maximum of 
|i2'(M)|; these numbers are of course independent of x. In 
the right-hand member of the last equality, (— 1 ) 0 , 

14n(l)| ^ fn, — 1)1 < dfo, while |A„C<)I ^ *n, 

\R'{x — 0| ^ Ml, over the whole interval of integration. 
Consequently 

I A!n (t) R{x — t) dt < ilfo «n + 2 Ml fn . 

Since lim„.r.oQ «« — 0, 
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approaches zero uniformly for « ^ x ^ The same con- 
clusion holds if Xn{t) is replaced by Zn-fi(0- On the other 
hand, Xn+i (x) and Xn {x) are uniformly bounded over 
(a, /5). Hence, on assembling the constituent parts of the 
integral expression for Sn (x), it is seen that Sn (x) approaches 
zero uniformly over the interval in question: 

Theorem XIIL If f{x) is summahle together ivith its square 
over the interval ( — 1,1), and identically zero ove?' an interval 
« — + ^ contained in ( — 1 , 1 ), and if Snix) is 

the partial sum of the Legendre series for /(x), then Sn{x) 
converges uniformly toward 0 for cc ^ x ^ 

With the corollary of Theorem XI in Chapter I, this yields 
at once the further 

Corollary. If f{x) is summahle together with its square ovei' 
the interval ( — 1, 1), and continuous over the interval a — 
< X < /S + , ^vith a modulus of continuity w (d) such that 
limtf=::o a)(d) log d = 0, the Legendre series for f{x) converges 
uniformly to the value f{x) for u < x ^ /i. 


j. Degree of convergence of Legendre series under 
hypotheses involving limited variation 

It remains to consider questions of degree of convergence. 
The discussion will be based on the properties of the Legendre 
polynomials already used, together with the identity 

Xn(x) = 2^^[X4+i(a:)~A’;.-i(3)]. 


Let f{x) be a function of limited variation for — 1 < .r < 1, 
its total variation being F. Let 9 'i(x) and ^tix) denote its 
positive and negative variations, measured from the left-hand 
end of the interval, so that f{x)=f{ — l) + 9 >i(a;) — <fi{x), 
(Pi{ — 1) — y)a{ — 1) = 0. By the second law of the mean, 


y, (a;) A» (x) dx ^ <fi (1) J] Xn (x) dx 



72 


THE THEORY OF APPROXIMATION 


for some value of $ in ( — 1 , 1 ). But as a consequence of 
the identity in the preceding paragraph, with the observation 
that X„+,(l) -= = 1 , 

JJ* X. (./•) dx = [X„+i {x} - X„-i (a-)]‘ 

For a sufficiently close estimate of the magnitude of the 
difference in the last bracket, there is occasion to go back 
to the identity 

Xn (x) = -— [^ [r + / (1 — cos d (p . 
n e/o 

If this formula isused to express the difference Xn-\{x)—Xn^iix) 
by means of a single integral, the integrand is 


[x + i{l — cos { 1 ~ [x i (I — cos 
= [:r + /( 1 cosc/)]'^“^f 1 — 1 —xrY^'i 1 -f- COS"^?)— 2 COS9’] . 

For X in (—1,1), the absolute value of the last bracket 
can not exceed 4 , and hence 

|X„_i(.r)-X.+iWi < r \x-]-i{l-x^yi^cos^^-^d<f. 

71 t/0 

On repeating, with slight changes of notation, the reasoning 
which led to inequalities for the Legendre polynomials in 
Chapter I, it is found that 

:r* -)- ( 1 — x^) COS* y e 


for the values of the variables that come into consideration, 
if 2' = (2/7r)(l — and 


J ^7l/2 
0 ' 




91/2 


{n — iyi'^z 


(In 


J '^QC 

j *00 

-u’ 

0 


dn. 
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With the observation that n — l^\n for n 2, the pre- 
ceding inequalities may be summarized by saying that there 
is an absolute constant c such that 


Xn-x{x)-Xn-^l{x)\ < c/nl/2 


for n > 1 and for — 1 < a; < 1 ; it is clear that the case 
71 — 1 can be included in the final statement. (Strictly 
speaking, of course, the result is obtained directly for la:| < 1 , 
and extended to the ends of the interval by continuity.) 
The essential point is that there is no factor (1 — in 
the denominator on the right, as there was in the corres- 
ponding inequality for a single Legendre polynomial; its 
absence is due to the factor (1 — which comes out before 
the integral sign in the expression for the difference. 

Applied to the problem in hand, the inequality just obtained 
shows that 


iX' 




yi(l) 
2n -f i 




Similarly, the magnitude of the integral formed with fi{x) 
in place of yiCr) does not exceed cy* (l)/(2n^'^), while for 
n > 1 the integral corresponding to the constant /( — I) is 
zero. Consequently, as yi(l) + ys(l) — T’, 

X ' 1 (• F 

The analogue of Theorem III is 

Theorem XIV. If f(x) is of limited variation for — 1 ^ a: 
< 1 , its total variation being V, and if an is the coefficient 
of Xn{x) in its Legendre series^ theUy for 


where ifo is an absolute constant, so that 


an An(x)| < 


71 


for — 1 + ^ 1 — V} constant depending ordy 

on tj . 
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Now let it be assumed that f{x) has a first derivative with 
limited variation, and let the total variation of fix) for 
— 1 ^ < 1 be denoted by V. (In more general terms, it 

would be sufficient that f{x) be the integral of a function 
of limited variation, and not necessarily provided with a unique 
derivative at every point.) For the moment, let 

Ynix) = Xn(t) dt. 

By integration by parts, 

Xn{x) dx = Yn{x) dx 

- [f{x) dx. 


In the last member, Yn{ — 1) = 0, and for n ^ 1 , FnCl) “ 0 
also. Furthermore, by the identity already employed. 


¥n(x) = 


2 fi^ \ 1 (:^’)] ; 


the terms coming from the lower limit of integration cancel 
each other, since Zn+i ( — 1) = Xt-i( — 1) = ( — ^ . Hence, 

with the assumption that n 1 , 


^_^f{x)Xn{x)dx = [Xn-i (x) — An+iC^:^)] dx. 


If the integral on the right is taken as the difference of two 
integrals. Theorem XIV, or, more directly, the inequality 
immediately preceding the statement of that theorem, can be 
applied to each. It is to be noted once more that l/{n — 1) 
^ 2/n for n > 1 . The resulting general formulation, however, 
does not hold forn = 1, as may be seen by taking /(:r) = Oxy 
in which case F — 0, while O may be arbitrarily large; the 
conclusion is essentially restricted to values of ^ 2. In- 
duction based on repetition of the process of integration by 
parts leads to a conclusion which may be stated as 
Corollary I. If f{x) has apth derivative of limited variation, 
^ > 0, and if V is the total variation of f^^^ (.r) for — 1 ^ ^ 1 , 

the^iy for n >Pj 
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lOnl ^ 


RpV 

nP+m) ’ 


where Rp depends only on p, and 


-Xn (3?) I 


1 


for — ^ 'f ri ^ 1 — 11 , where Rp,^ depends on p and on >/ , 
but not on X or on n. 

Suppose fix) is of limited variation for — 1 < x < 1 , with 
total variation V, and identically zero for « — y 
Let R{ii), as before, be equal to Hu tor \ u\>fj, and defined 
in some way for | m | <C ^ so as to have a continuous derivative 
everywhere. If ifi is the maximum of |iJ'(u)|, the total 
variation of E(u) over any interval of length 2 does not 
exceed 2 Mi, The quotient f{t)l(x — t) is identical with 
f{t)R{x — t), if X has any value, belonging to the interval 
^ It is of limited variation, regarded as a function 

of and its total variation has an upper bound which may 
be expressed as the product of L by a quantity depending only 
on 71, and, in particular, independent of :r. So Theorem XIV 
may be applied to the function f{t) K{x—t), and the result 
substituted in the integral expression for Snix), to prove 

Corollary II. If f(x) is of limited variation for — 1 ^ ^ 1 , 
with total variation V, and identically zero throughout an 
interval a — + ^ contained in ( — 1 , 1 ) , and if 

Sn {x) is the partial sum of its Legendre series, then, for 

jS’nCa-)! < — , 

n 

where ( 7 ^ depends only on fj. 

Further consequences can be deduced substantially as in 
the case of Fourier series. If f{x) has a bounded derivative 
for — 1 ^ ^ ^ 1 > it comes under the hypothesis of the 
Corollary of Theorem XI in Chapter I, and its Legendre series 
converges to the value f{x) for — 1< x < 1 . If the series 
converges uniformly for — ^ ^x <1, its sum agrees in value 
with f{x) at the ends of the interval also, by continuity. 
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So Corollaiy I above yields information as to the order of 
magnitude of the error, under the hypotheses indicated. A result 
may be stated for the entire interval ( — 1 , 1), as well as for 
an interior interval, since | «« An (a;) | < j n,, | for — 1 ^ a; ^ 1 : 

Theorem XV. If f(.x) has a pth derivative with limiteil 
variation, p if V is the total variation of (x) /w 
— 1 X < 1 , and if 8n (x) is the partial sum of the Legendre 
series for fix), then, for n> p, 

\fix)-8nix)' 

for — 1 ^ ^ 1> tvhere Qp depends only on p, and 

i/(.r) — .S»(x)| <; 

for — 1 +^7 ^ oc.< 1 — tjf tvhere Qpj^ depends only on p and on 
In conclusion, the following consequences of Corollary II, 
taken first with the corollary of Theorem XI in Chapter I 
and then with the Theorem XV just formulated, may be 
stated without further comment: 

Theorem XVIa. If f{x) is continuous mth modulus of 
continuity w(d) throughout an interval a— + 
contained in ( — 1, 1), tvhere wf(J)>0 for d>0, and of 
limited variatmi over the rest of the inteival ( — 1, 1), then 

\f{T)—Sn w I r«(2/ n) log n 

for a<x<fi,ifn is large enough so that (»(2/w) has a meaning, 
Sn (x) bei7ig the partial sum of the Legendre series for fix), 
and c a constant depending neither on x nor on n. 

Theorem XVIb. If fix) is of limited variation for — 1 
< a- < 1 , and has a first derivative of limited variation 
throfughoat an interval « — contained in 

i — 1, 1) , then 

'f{x) — Sn(.x)\'^ Y 

for ct X ^ j U/here S,i ix) ts the partxal sum o^ the Ijegendt e 
serm for fix), and c is a constant depending neither on x nor 
on n. 



CHAPTER III 


THE PRINCIPLE OF LEAST SQUARES 
AND ITS GENERALIZATIONS 


I. Convergence of trigonometric approximation as 
related to integral of square of error 


In the discussion of Fourier series hitherto, scarcely any 
mention has been made of one of their most remarkable and 
important properties, namely their relation to a problem ot 
least squares. The propert}' in question is not peculiar to 
Fourier series, but is of much wider significance. 

Let po(-r). Pi (a:), ^>s(ir), • . be a sequence of normalized 
orthogonal functions over an interval («, h), that is, a sequence 
of functions satisfying the conditions 


PJ (a^’) pk (a) (lx — 0 0 + it)- 



1 . 


Let fix) be another function defined over the same interval. 
The functions pk ix) and /(x) may be continuous, or, more 
generally, bounded and integrable in the sense of Kiemann, 
or merely summable together with their squares. The problem 
of least squares in question is that of determining a set of 
coefficients co, •••, r„, for a given value of n, so that when 

SPix) CoPo(a:) + c,jh(x)-i !-<‘«PnCr) 

the integral 


regarded as a function of its coefficients, shall be a minimum. 
The following theorem is well known: 

Theorem I. The integral has a minimum value, for the 
attainment of which it is necessary and sufficient that 


Ck —^fU)Pkix)dx, 


77 


/■• = 0 , 1 . • • - , 



78 


THE THEORY OF APPROXIMATION 


It is to be noticed at the outset that no function of the 
form ^{x) can vanish identically over the interval unless all 
the coefficients Ck are zero; in other words, the conditions 
on the p’s insure that they are linearly independent. For 

£[f(x)Ydx + ■■■+<> 

and the identical vanishing of y would require the vanishing 
of the sum of the squares. More definitely, it is recognized 
that the value of the integral on the left must be positive, 
if the c’s are not all zero. If the ^’s are continuous, this 
means that there must be one or more intervals throughout 
which (f remains different from zero; if they are not so 
restricted, it means in any case that (p must be different from 
zero over a set of positive measure. 

To prove the necessity of the condition, suppose that <p{x) 
is a function of the form ^CkPkix), in which, for a particular 
ndex k — m, 

Cm I I fix) Pm (x) dx. 

Let 

Ip (x) = CoPo (a;) + c, /Ji (a-) + h iCm h)p mix) -\ + VnPn ix) 

9ix) + hpmix), 

where is a constant, the value of which will be specified 
presently. Then fix) — xpix)=^ fix) — fix) — hpm ix), and 

L/’(a;)— '^(a;)]*(ia: =J^ [fix) — fix)Ydx 

~ 2/i [fix) — f (a;)] Pm (a:) dx + If [pm ix)Ydx. 

The last integral in this equation is equal to 1 ; if the others 
are denoted by J, R, S respectively, 

E — 2hS + h\ 

But r fix) Pm ix)dx = Cm, and 

^ [fix) — f ix)] Pm ix) dx = ^^fix)pm (a?) dx — Cm 4" 0 , 
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by hypothesis. Let h be taken equal to 8 \ then 
J ^ R~ 8 *<E, 


and the value R corresponding to the function y is not the 
smallest value of the integral, since the function rp gives 
a smaller value. 

To show that the condition is sufficient, suppose now that 
^ ~ ^CkPk with Cfc — /(x) pk{x) dx for it = 0, 1, • • •, w, 

and let 

^l{x) = ci)P(i{x)-\- Cipi{x)-{- • ■■ ^-c'nPnix), 


where at least one clc is different from the corresponding Ck. 
Then fix) — rpix) is a linear combination of po, Pn, with 
coefficients which are not all zero, and 

F — [9’(3') — ^{x)]* dx>0. 

On the other hand, 

nb 

\fix)pkix)dx = Cfc fix)pk(x)dx, 

[/(^) — fix)] Pkix) dx ^ 0, /c = 0, 1, • • •, M, 


so that if x(x) — 2ck pk(x) is any linear combination of 

Poy • • • 1 Pn, ^ 

[fix) — y(a;)] xix) dx 
= 2 Ck ( [fix) — 5p(a;)] Pkix) dx -= 0. 

fe=:0 

In particular, 

t7 = r [fix) — fix)] [ 9 )(a;) — ipix)] dx = 0. 

t/ 0> 

Hence, by the resolution / — tp = if — y) + (9) — «/')> 

[/(a?) — i^ix)y dx — [fix) — yCa:)]* dx-\-2U-]-V 
> [fix) — f (a;)]* dx, 
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which means that y actually gives the integral a smaller 
value than any other linear combination of 

As the most important of specific cases, ih, Pi, • • • may 
be the sequence of functions 

1 cos a; siii j- cqs2^r sin 2 a- 


If u above is re])laced by n = 2 >i + 1, y (j-) is a trigonometric 
sum T„{j') of the 7ith order. A periodic function f{x) being 
given, the condition that the integral 

shall be a minimum, is that the coefficient of cos/.- a:/ be 



cos /. t 

' ;r^ 




for k:>0, with a corresponding determination of the constant 
term and the sine terms, and this is equivalent to saying 
that Tn{x) shall be the partial sum of the Fourier series 
for /(x). 

This property of least squares may be taken as a point of 
departure for a discussion of the convergence of the series, 
under suitable hypotheses wth regard to f{x). While the 
hypotheses are more restrictive than those used in the con- 
vergence proofs ali’eady given, the method is of interest for 
its own sake, and fnrthemore lends itself to a remarkable 
variety of generalizations in other directions. It depends on 
theorems of approximation from Chapters I and II, and also 
on a proposition, due in the first instance to S. Bernstein, 
the simplicity and importance of which are noteworthy in 
the highest degree: 

Bernstein’s Theorem, 1/ Tnix) is a trigonometric sum of 
the nth order, and if L is the maximum of its absolute value, 
the absolute value of the derivative Tn{x) can not exceed nL. 

The proof to be given here was devised independently by 
Marcel Biesz and by de la Vallee Poussin. For the purposes 
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of the demonstration, the theorem may be restated in the 
following equivalent form: 

If Tn{x) is a trigonometric sum of the nth order ^ and if 
the maximum of\Tn{x)\ is 1, the maximum of\Tn{x)\ itself 
can not he less than 1/n. 

Suppose the maximum of \Tn{x)\ were less than 1/n. 
Then, for any value of the constant (7, the function 

I^n(x) = y- cos (no: C) Tn(x) 

would have the same sign as cos(nrr — C) at each of the 
points C/n, iC+7i)/n, {C+2n)/n, {Cln)-\-2n, where 

cos(wa: — C) takes on the values +1 and — -1 alternately. 
Hence Rn{x) would vanish at least once in each of the 
2n intervals between successive points of this set, say at 
the points .Ti, all contained in an interval of 

length 2n. By Kolle’s theorem, Rn{x) would vanish between 
a’, and between 0*2 and.xg, etc., and also, as Rnixi + 2n) 
= = 0, between X 2 n and Xi + 27i; and the 2n distinct 

roots of Rn thus specified all lie within the interval from 
Xi to Xi-\-2n. Explicitly, 

Rhix) — — sin(na; — C) — Th{x). 

If C is chosen so as to make sinf/ix— C) equal to the 
negative of Tn{x) at a point where Tn{x) = ±1, i?n(:i^Xwill 
have a double root at this point; for each of the functions 
sin (nx~ C), Tn (x) separately attains a maximum or a minimum 
there. Having 2n distinct roots in a period, and at least 
one double root, R^ has roots of aggregate multiplicity at 
least 2n+ 1 . But this would require that as a trigonometric 
sum of order vanish identically, which is impossible, since 
Iinis sometimes positive and sometimes negative, by hypothesis, 
and so can not be constant. Consequently the assumption 
that 1 Tn (x) 1 remains less than 1/n is inadmissible. 

(The fact that a trigonometric sum of order n can not 
have roots of multiplicities aggregating more than 2n without 
vanishing identically is assumed here as known. A special 
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case of it, sufficient for the present application, is proved 
incidentally in another connection in Chapter IV below.) 

To proceed with the proof of convergence, let f{x) be 
a continuous function of period 27r, and Sn{x) the partial 
sum of its Fourier series to terms of order w, and let 

rm 

Yn = [/ W — Sn ix)Y dx . 

C/ ~7l 

Let tn{x) be a trigonometric sum of the nth order, sub- 
sequently to be chosen so as to furnish an approximation 
to fix), but arbitrary for the present, and let 

fix) tnix) ^ r(a;), ^%ix) — tnix) = tnix). 


Then r — t,i is identical with f—Sn, and 



Let €n be an upper bound for the absolute value of r(r): 


r ix) --- I fix) — tn {x) \< tn 


for all values of x. Finally, let iin be the maximum of 
! Tn ix) I , and xo a point where | (a:o) | = 

By Bernstein’s theorem. 

\Tnix)\ < nfln 


for all values ot x. For \x — ao|<l/(2n), consequently, 

I Tfi ix) In ixo) I ^ j j Tn ix) j fin • 

If bin'll (the contrary case being reserved for separate 
mention), 

I (»■) i l*n , I r (x) — Tn ix) I > ~ , 


throughout the interval specified. As the length of the 
interval is I In, 
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^(4") ’ 

whence iin < To suppose, on the other hand, that 

is to assume outright that So in either 

case fin does not exceed the larger of the quantities 4^^. 
4(w;'n)^^^, or, in a single formula, 

fin < 4(nrny/^ + 46n. 

But I In I f^ni \^'\~ ^n, uud r — Tn is the same as / — A';,. 
Consequently 

\f{x) — Sn (o:) I < 4 (n + 5fn 
for all values of x. 

The function f{x) being continuous, it is always possible 
to choose tn{x) for successive values of 7i in such a way 
that limn=oo^/i = 0. So the relation just obtained amounts 
to a proof of 

Theorem II. If f{x) is a continuous function of period 2n , 
its Fourier series will converge uniformly to the value f{x), 
provided that 

lim nyn — 0. 

n = oo 

Jn particular, Theorem VI of Chapter II yields at once 
Corollary I. A sufficient condition for uniform convergence 
is that f{x) be absolutely co 7 itinuous. 

By Theorem II of Chapter I, on the other hand, if « (d) 
is the modulus of continuity of fix), there will exist sums 
tnix) such that 

\fix) — tn (x) I < K' O) (27r/n), 
where K' is an absolute constant Then 

^ Jifix) — tn ix)Ydx [^'w (2 nj /i)]*. 

But as the integral takes on its minimum value when aS^ (^ ) 
is put in place of tnix), the right-hand member of the last 
relation is an upper bound for So 71 yn will approach 
zero if rY^nai^nl yi) approaches zero, and it is possible to state 


6 * 
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Corollary IL A sufficient condition for uniform convergence 
is that tvhere 00(6) is the modulus of 

continuity of f(x). 

The conditions of Corollaries I and II of course overlap 
to a considerable extent, but they are not coextensive, and 
neither includes the other. As previously mentioned, they 
are less general than others which have already been obtained ; 
absolute continuity is more stringent than limited variation, 
and the requirement involving 6^’- is more restrictive than 
the Lipschitz-Dini condition. 

The novel possibilities of the present method will be more 
apparent, if it is remarked that in the proof of Theorem II, 
down through the final relation of inequality preceding the 
statement of the theorem, no use whatever was made of the 
assumption that Sn{x) was the Fourier sum for fix), and 
the value of the integral Yh a minimum; the argument can 
be repeated step by step with Sn{x) replaced by Tnix), an 
arbitrary trigonometric sum of the t^th order, and Yn replaced 
by Qnj the integral of [fix) — Tn (^)]^ over a period. The 
conclusion is 

Theorem II a. If fix) is a continuous function of period 
2n^ Tnix) an arbitrary trigonometric sum of the nth order, 
and 

9n == JILV)- 

and if there exists a trigonometric smn tn ix), of the nth order, 
such that 

\f{x) — tn ix) I ‘ f n 

for all values of then, for all values of x, 

\fh)—Tnix)\ < 

To this may be added immediately 

Theorem Ilb. If sums Tn ix) are defined for infinitely 
many values of n in such a ivay that gn 1 : Ayh, where A is 
independent of n, the sums Tnix) will conva^ge uniformly to 
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the valm as n becomes infinite, under the conditions of 

Theorem \\ or either of its corollaries. 

The next application depends for its simplest expression on 
an existence theorem, the proof of which will be postponed 
for a few pages, to avoid interruption of the current order 
of ideas. The statement of the theorem is as follows: 

Theorem III. If q{x) is a summahle functiori of period 2 7i 
which is nowhere negative, and is different from zero om* 
a point set of positive measure in a period, if f{x) is a sum- 
mahle function of period 2Tt such that (>(:r)[/(x)]* is also 
summahle, and if Tn (x) is a trigonometric sum of specified 
order n, the integral 

[/(a) — Tu 

has a minimum,' which is attained for one and just one deter- 
mination of the coefficients in Tn {x). 

The truth of this assertion being assumed for the moment, 
attention will be directed to the question of the convergence 
of Tn (x) toward f{x) as n becomes infinite, the function fix) 
being supposed continuous. The problem is connected with 
an extensive theory developed from a different point of view 
by Stekloff, SzegO, and others. The minimizing sum may be 
called the approximating sum of or de)' n corresponding to the 
weight function p(^), and it will be understood now that 
Tnix) denotes this particular sum. The corresponding minimum 
value of the integral will be denoted by Gn- For the con- 
vergence proof, let q{x) be further restricted by the hypo- 
thesis that its values are comprised between two positive 
bounds: 

o<v ^ p(a:) t; 

where v and V are independent of x. 

By the minimizing property of Tn (x ) , 

On j ^ Q (:r) [f{x) — Tn (x)Y dx 
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where 6»(x) and have their previous meanings. On the 
other hand, with the notation of Theorem II a, 

(^n ^ V \ \f(pc)—Tn{oc)ydx ^ vgn^ 

So gn ^ {V/v)rny and the conclusions of Theorem II b are 
applicable : 

Theorem IV. If the weight function q {x) has a finite upper 
hound and a positive lowe)' bound, the approximating sum Tn{x) 
will converge uniformly toward f{x) as n hecmnes infinite, if 
luiin^oonYn = 0, and hence, in particular, if f{x) is absolutely 
continuous, or has a modulus of continuity such that 

lim^^o a)(d)/d^^2 — 0. 

In modification of this result, let the restriction |^(cr)i < 
be removed, the hypothesis with regard to q being merely 
that it is summable, and that ^ > i; > 0 everywhere. Since On 
is a minimum, 

On <^^q{x) [fix) — tn ix)ydx 

if inix) is any trigonometric sum of the nth order. Let Sn 
be an upper bound of \f — ^|, as before. Then 

with the notation ^ q — 1. As it is still true that On > vgn, 
it follows that gn < {I/v)e^. Under the hypothesis that 
lim^=o w ~ 0, <»(x) can be chosen so that limn=« 

= 0, which means that lim»=oon 5 r„= 0, and Theorem II a 
shows that Tn (x) converges uniformly to f[x) once more. 

3. Convergence of trigonometric approximation as 
related to integral of mth power of error 
To generalize in another direction, consider the integral 

Q (a:) \f{x) — (x) I*" d X , 

in which m is an arbitrary positive number, taking the place 
of the particular exponent m — 2'that has been used hitherto. 
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When m, w, ^(x), and f{x) are given, f{x) being bounded 
and measurable, it can be shown that there exists a deter- 
mination of the coefficients in Tn {x) for which the value of 
the integral is a minimum, provided that is summable, 
everywhere positive or zero, and different from zero over 
a point set of positive measure in a period. When m>l, 
the determination is unique. The existing presentations do 
not give the theorem with quite this degree of generality, 
but it is not difficult to supply the necessary extensions. For 
somewhat less general functions q (in particular, for the case 
that ^ = 1 identically) it has been shown that the deter- 
mination is unique for m = 1 also, if f(x) is continuous. 
It is not generally unique when m < 1 . The proofs will be 
omitted here, the facts with regard to convergence being 
formulated in such a way as not to presuppose a knowledge 
of the theorems of existence and uniqueness. 

For the discussion of convergence, let f{x) be a continuous 
function of period 2 tt, and let it be supposed that there is 
a number v such that 

Q ^ ^ ^ 

for all values of a:, while q{x)j as always, is summable over 
a period. Let 

g„ =--- i/(a?) — Tn (a;) \'^dx, 

Tn (x) being for the present an arbitrary trigonometric sum 
of the nth order. (The notation is changed somewhat from 
that previously used in treating the case m = 2.) Let it be 
assumed that there is a sum in (a:) , likewise of the nth order, 
such that 

1/(t) — <„(.r) ! < Sn 
for all values of x, and let 

f{x) — tn (a;) — r{x), Tn {x) — tn (x) = *•„ (x) , 
whence 


^ (a?) i r (a:) ta{x)\'^ dx = gn- 
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If /*» = I Tn (•i'o) I is the maximum of | r„ («) | , it follows that 
I 1 < n/*n, by Bernstein’s theorem once more, and there 
is an interval of length at least 1/n throughout which 

I *'» (3?) f n (Xo) I ^ I *■« (x) I > 

I »• (x) — (ar) I > 
provided that f« < \ /*„ . Then 

ffn > ~ . /tn < 4 (ngjvyi”^. 

Whether e„ < 1 Pn or not, 

< 4 (ngn/vY'”^ + 4 , 

and 

I / (x) — 7’» (x) I - (x) — (x) I < 4 (/i gJvY’”^+ 5 f„ . 

For each value of n, let Yn be the greatest lower bound 
of the integral gn, when all possible values are given to the 
coefficients in Tn{x), while the functions p and / and the 
exponent m are held fast. It is clear that rn ^ 0 . Nothing 
is assumed as to the possibility of making g^ actually equal 
to Yn> by one or more deteminations of the coefficients. 
As gn can not be less than y« for any sum of the nth order, 
it follows in particular, since tn(x) is such a sum, that 

Yn ^ (c) fix) — tn (x) |"‘ (lx s 

the value of J*p being denoted by I. If sums 7’„(x) are 
chosen for the successive values of n so that gn^ A Yn, where 
A is independent of w, then g^”' < (.•17)*’"*e», and 

/(x) — T«(x)| ^ 

where B is likewise independent of n . The sums Tn (x) wll 
converge uniformly toward fix), if can be made to 

approach zero, and this will be possible under conditions 
indicated by Theorem n of Chapter I, if m > 1 ; by the 
Corollary of Theorem IV of that chapter, if m = 1, and by 
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Theorem IV itself, if 0<wi<l. The principal results may 
be summarized in the following formal statement: 

Theorem V. If the weight function pCa;) is summaUe and 
has a positive lower bound, and if a sum Tn (x) of Hie nth 
order is chosen for each of an infinite set of values of n so 
that gn < Ayn, where A is independent of n, these sums will 
converge uniformly t<nvard fix), when m is held fast and n is 
allowed to become infinite, i/" m > 1 atvd fix) has a modulus 
of contimiity w(d) such that limj...oo»(d)/d‘'’" = 0, or if 
m — \ and fix) has a continuous derivative. 

The explicit statement for »i < 1 is more complicated and 
less interesting. ■ In the cases covered by the theorem as 
stated, for m > 1 , there is in fact a determinate approximating 
sum for each value of n, making gn~ Yn, and the conclusions 
of course apply in particular to the convergence of these 
approximating sums. On the other hand, it is clear that A 
can be replaced by a factor An which becomes infinite with n, 
if more restrictive hypotheses are placed on fix), so that 
can be made to approach zero with sufficiently increased 
rapidity. 


3. Proof of an existence theorem 

To carry out an intention expressed above, it remains to 
supply a proof of Theorem III, with regard to the existence 
of an approximating sum corresponding to a given weight 
function when m — 2. 

It will be worth while to begin with more general con- 
siderations. Let qoix), qiix), g»(a:), •••, be a sequence of 
functions, each summable together with its square over an 
interval (a, 6). Let these functions be linearly independent, 
in the sense that any linear combination of a finite number 
of them, with coefficients not all zero, is different from zero 
over a point set of positive measure in (a, 6), which is equivalent 
to saying that the integral of the square of such a linear 
combination, extended from a to b, is positive. The requirement 
may be indicated briefly by saj^ing that the q’s are properly 
independent. Let 
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fgiQo 

Qo ~ Qo 7 “ Q\ Qo 

/« 



it is understood that each integration is to be performed 
with regard to x, from a to b. Each denominator is different, 
from zero, because of the hypothesis of proper independence. 
It is seen from the definition of Qi that J*QoQi — 0. Since this 

is the case, the definition of Q 2 makes j QoQi = J Qi Qi 0. 
Generally, each Q is orthogonal to all the preceding Q’s, or, 
in other words, any two of the Q’s are orthogonal to each 
other. If 

Pk(x)^ 

the functions pk form a normalized orthogonal sequence, to 
which Theorem I is applicable. Each pk is a linear combina- 
tion of Qo, Qky and conversely each qk can be linearly 
expressed in terms of pk and the p's of lower order, since 
the coefficient of qk in the expression for pk is not zero. 

Now let q(x) be a non-negative summable function of 
period such that 

J ^q{x) dx >0, 

and let [Q(x)yf'^ w{x). The functions 

u){x)j w{x) cos.r, w{x) sin x, w{x) cos 2x, iv(x) sin 2x, • • • 

are summable, with their squares, and properly independent ; 
a linear combination of a finite number of them, with co- 
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efficients not all zero, is a trigonometric sum multiplied by 
^{x), and is different from zero wherever p 4= Cl; with the 
exception of a finite number of points at most in a period. 
If they are taken as the functions qo, Qi, qt, • ■ of the 
preceding paragraph, over the interval ( — n, n), the corre- 
.sponding functions Po,PuP», •■■■ have the form 

w (a;) Uo ix) , w {x) Ui{x), n; (x) F, (x) , w (x) Ut (x) , 
wix) FjCr), . • - , 


where IJk and Vk are trigonometric sums of the Hh order, 
and 


Uj{x) Uk(x) dx ~ V}(x) Vk(x)dx = 0 (; 4 

J^^q(x) Uj{x) Vk{x) dx: — 0 for all j and k, 
(x) [f/fc(x)]*cfx === [l^k (>»)]* (ix = 1. 


If T„(x) is any trigonometric sum of the «th order, wix) Tnix) 
is a linear combination of the functions now taken as the q's. 
and so is a linear combination of the functions w{x) Uk(x), 
w(x) Ffc(x), k = 0, I ,■■■, n, and T„ itself is expressible in 
terms of Uk, Tk, with the same coefficients. 

Let /(x) be a summable function of period 2 /t , such that 
^( 33 ) [/(a*)]* is also summable. Then 

[fix)— -Tnix)]* dx = J^^[m;(x)/(x) — /(’(x) T„(,t)]* dx, 

and Theorem I can be applied, with wix) fix) as the function 
for which an approximation is sought. The integi'al has 
a minimum value, and if 


Tnix) = .i, (X) + L\ (x) + . . . + An Unix) 

+ B, \\ix)+ ...+P„F„(.t), 

it is necessary and sufficient for the attainment of the minimum 
that 
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Au == w (a;) f(x) • w (a?) Uu (x) dx = q (x) f{x)Ukix)dx, 
Bk ^£jix)fix) Vk{x) dx. 


4. Polynomial approximation 


Attention will next be directed to problems of polynomial 
approximation. An essential instrument here is the form of 
Bernstein’s theorem in which Bernstein himself was primarily 
interested : 

Bernstein’s Theorem for Polynomials. If Pn (oc) is 
a polynomial of the nth degree^ and L the maximum of 
I ^n(^) i for — 1 ^ rr ^ 1 , then 



nL 


for — 1 << x <; 1 . 

It can be deduced immediately from the theorem for the 
trigonometric case. Let a: == cos Then Pn{x) ~ Pn (cos d) 
is a trigonometric sum of the wth order in By having L for 
the maximum of its absolute value, and consequently 


■^P„(cosfl)| = I sin OP;! (cose) I = |(1— xT' P„'(a)! < nL, 


whence \Pn{x)\ <nLI{\ — 

If L is the maximum of [PnCx^l for a <x < by let 

2x — a — 1) 


Then Pn{x) ^ Qniy) is a polynomial of the nth degree in y, 
having L for the maximum of its absolute value in the 
interval — 1 ^y ^ 1. Therefore 



Pn{x)- 


dx 

dy 



Pn{x)\ < 


11 L 

(1 ’ 


by substitution of the value of y in terms of x in the factors 
(1 — y)y (l + y) under the radical sign, one is led to 

Corollary I. If Pn{x) is a polynomial of the nth degree y 
and L the maonimum of Pn{x)\ for a <x <1), then 
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l^n(:^)| < T/i:- 


nL 


[(h — x) (or — 


for a<x<Cb, 

This implies directly 

Corollary II. If Pni^) a polynomial of the nth degree, 
and L the maximum of \Pn{x)\ for u < x < b, and if 
a a\<,bi<by then 


< hnh 


for ai^ X hi, whe^'e h depends only on a, ai, hi, and b. 

A further corollary is to be obtained by the followinp: 
considerations. Let Pn{^) once more be a polynomial of 
the nth degree, and L the maximum of its absolute value 
for — 1 % X < 1 . Let x be a number of the interval 

— I X < 1, and d a positive number <1 such that 

— l<ir — d‘"l. Then 

i - J) - £_,KWdr< . 

It is readily seen that 

Ji ^ (1 — r-)‘- • 

^Por an analytical proof, let d - 2//, .r — ~ y- so that 
p" dt ’ dt _ 

Jy-r, 

Since 

_(/ dt 1 _ _ 1 


which is equal to 0 if // = 0 , positive if ^^> 0 , negative if 
y '-O, the value of the integral is a minimum when y 0. 
that is, when the interval of integi'ation is symmetric about 
the origin, and increases steadily as the inten’al is displaced 
toward either side.) As d < 1 , 0 < < <{ 1 in the integi’al 
from 1 — d to 1, 1 — /*>! — < >0, and 
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So 


r dt ^ r* dt 




!P„(a;) — P„(a; — d)| < 2nLd^l^. 


An equivalent statement is that if Xi and Xi are two numbers 
in ( — 1, 1), differing by not more than unity, 


Pn(oCi) — Pn{xt) \ < 2nL\x2 — Xi\^i^. 


Now suppose that L is the maximum of | Pn (x) | for 
a^x<bj and let Xi and x^ be two numbers of this interval, dif- 
fering by not more than ^{b — a). Let y ^{2 x — a — b)/{b — a) , 
the values of y corresponding to xi and x^ being yi and y^j 
and let Pn{x) = Qn(y)* Then L is the maximum of \Qn(y) 
for — ^ Sy = yi y^ ^WO numbers of this interval, 

differing by not more than unity, and 

\Pn{x2)-’Pn{xi)' = i Qn(y2) — Qn(^l) ! 2nL i 1/2 — 1/1 1^^^ 

-- 2nL[2\x2 — x^\/{b — a)yf\ 

The conclusion is 

Corollary III. If Pn (a:) is a polynomial of the nth degree^ 
and L the maximum of | Pn(x) \ for a <x <b, and if Xi 
and Xf are two numbers in the interval, dijfering by not more 
than \{h — a), then 

\ Pn {x^ — Pn (xi) \ < HnL\x 2 ~Xi 
where H - 2l[\{b — a)Y\ 

By the introduction of an intermediate value xv^ = \ {xi + x^), 
it may be seen that 

|Pn(x2)-P„(a:i)| < 

without the restriction that \xt — Xx \ < \ {b — a). 

The way has now been prepared for a discussion of 
problems of approximation associated with the integral 

Q (^) l/(a;) — Pn (x) I”* da: , 
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where q{x) and fix) are given functions in the interval 
(a, b)y Pn{x) is a polynomial of the wth degree, and m is 
a given exponent >0. 

When m = 2, a theorem of existence and uniqueness may 
be stated as follows: 

Theorem VI. If Qix) is a siimmahle function which is 
nowhere negative, and is different from zero over a point set 
of positive measure in the interval (a, h), if fix) is a siimmahle 
function such that (>(:^) [/(^)]* is also siimmahle from atoh, 
and if Pnix) is a polynomial of specified degree n, the integral 

f Q (a:) [/(a-) — Pnix)]* d x 

has a minimum^ which is attained for one and just one de- 
termination of the coefficients in Pnix). 

The proof is entirely analogous to that of Theorem III. 
The functions 

wix), xw{x)j x^wixf •••, 


where wix) ~ are summable together with their 

squares and properly independent over the interval (a, h). 
If they are taken as the functions Qkix) previously considered, 
the corresponding functions pkix) have the form wix) Ykix), 
where Ykix) is a polynomial of the kt\i degree, and 

VQix)Yjix)Ykix)dx - 0 O' + h), 

pj (I 

^^qix)[Ykix)Ydx 1. 

The polynomials Ykix) are called the Tchehychef polynomials 
corresponding to the characteristic function qix) in the intei- 
val ia, h). Any polynomial Pn(^), of the wth degree, can be 
expressed as a linear combination of Fo, Fi,***, Fn, and 
for the minimizing of the integral in the statement of the 
theorem it is necessary and sufficient that the coefficient of 
Yk in this expression be 

Wix) f{x) Yuix) dx. 
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In the case that q is identically equal to 1 and the interval 
is that from — 1 to +1, Yk{x) is e^ual to {k Xk{x) , 
where Xk is the Legendre polynomial of the kth degree ; and 
the approximating polynomial Pnix) then is the partial sum 
of the Legendre series for f{x). 

For an arbitrary value of m>0, theorems of existence 
and uniqueness can again be established in analogy with those 
of the trigonometric case. An approximating polynomial, 
reducing the integral to a minimum, exists if f{x) is bounded 
and measurable, and if is summable, nowhere negative, 
and positive over a point set of positive measure in (a, 6). 
The approximating polynomial is unique when m > 1 , and, 
with suitably restricted hypotheses, when m --- 1 also. The 
proofs will not be given here, the further discussion being 
put in such a form that a knowledge of them is not essential. 

Let f{x) be a continuous function for a ^,x< h, and Pn{x) 
an arbitrary polynomial of the ?ith degree. Let be 

summable over (a, i), and nowhere negative in the interval, 
and let q {x) > > 0 for «o ^ ^ where a ^ wo < 

(It will be noticed, as a departure from the conditions of 
the earlier convergence proofs, that the hypothesis (> ^^ > 0 

does not necessarily hold over the entire range of variation 
of X.) Let 

gn Q{x) \f{x) — Pn{x) (lx. 

Furthermore, let ptiix) be a polynomial of the 7ith degree, 
in general distinct from Pnig^y 

\f{^^ — Vn{x)\ < fn 

for a ^ x h, and 


f{x)—pn{x) 
so that 


q{x) \r{x) 


Pn{x) — pn{x) 


i{x)\;^ (lx = gn- 


Suppose that pn is the maximum of \ 7Tn{x) \ for ccq < x 
and that Xo is a point in («o, M at which this maximum is 
attained. By Corollary III above. 
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|7*»(x) — nn{xo)\ < Hn/tn x — 

as long as x is in (ag, fio) and distant from by not more than 
i (/®o — «o), the factor H being equal to 2/[i (/Sg — «g)]^'*. These 
conditions will be satisfied for n < ' 1 throughout one or the 
other (or both) of the intervals (srg — d, a;g), (:rg,a-g + d), if 

i 1 ~ <*g 

“ ^H*n* ~ '32 «* • 


Over such an interval, then, since ja — a-gl^'-* < 1/(2 ifn). 


|7r„(a-) — 7r„(a:g)j < 

If fa < \ltn, 


1 

2 


|r(a-) — Xn(x)\ 


l/«'rt(a‘)i > 




over the same range; and, as the length of the interval is 
l/(4.ff*«*). 




V 

4^*«* 





with Bn = 4(4/ir*/j;)''”'. Whether f„ < or not. 


(la < So + 4 e„, 

\f{x) — i-’„(x)i !r(.r)— .T„(ir)' < Boin*f/„y”'+bf„ 


throughout (og, /^g). 

Let rn be the greatest lower bound of y„, when six), fix), 
and M! are given. As an immediate consequence of the 
definition, 

ra ?(a.)i/(a;) — p^ix)\”^ dx ^ fcj*. I = Qi;x)dx. 

If polynomials Pn (ar) are chosen for an infinite succession of 
values of n so that gn ^ Ayh, where A is independent of «, 
then g)l”* < iAiyi”*e^, and 

|/(a;) — P„(x)| ^ 
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for ccq ^ X < fioj with a factor B which is likewise inde- 
pendent of n. 

The conditions to be imposed on fix), in order that it may 
be possible to make approach zero, are to be obtained 

from Theorems VI and VJII of Chapter I, and the Corollary 
of the latter. It will be sufficient to state the results form- 
ally for m>2, though corresponding statements could readily 
be added for 0 < m < 2 : 

Theorem VII. If q{x) is summable ovei' {a, h), and has 
a positive lower hound for ao ^ ^ ^ t ivhere a ^ «o < 

^ h, and if a polynomial Pnix) is chosen for eacJt of an 
infinite set of values ofn so that gn Ayu, where A is inde- 
pendent of n, these polynomials icill converge uniformly toward 
/(^) f^^ ^0 ^ ^ fioi adien m is held fast and n is allowed 

to become infinite, if m'> 2 and fix) has a modulus of con- 
tinnity a)(d) such that lim^ - q - 0, or if m = 2 

and fix) has a contmuous derivative. 

Under the conditions formulated, there will in fact be one 
and just one polynomial of each degree n for which gn = Ynj 
not merely for m = 2, in accordance with Theorem VI, but 
also for an arbitrary m>2, under a theorem cited above 
without proof. Theorem VII naturally applies to the con- 
vergence of these approximating polynomials. More particular- 
ly still, for (a:) = 1 , a = — 1 , h 1 , one may state 

Corollary I. If fix) has a continuous derivative for 
— 1 ^ a; ^ 1 , its Legendre series converges unifoy mly toward 
fix) throughout the dosed interval. 

Another part of the content of the theorem, aside from the 
question of uniformity of convergence, is expressed in 

Corollary II. If is continuous and nowhere negative 
for a ^ x ^ b, the hypotheses of Theor'em VII remaining 
otherwise unchanged, the polynomials Pn ix) will converge toward 
fix) at all points where gix) 4 0. 

Additional information can be gained by further elaboration 
of the proof of the theorem. Let the previous notation be 
continued in force, . with the understanding now that poly- 
nomials Pnix) and pnix) are defined for all positive integial 
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values of w, and that these sequences of polynomials are 
kept unchanged throughout the course of the reasoning. Let 
a and ^ be two numbers such that Let 

••• and be two infinite successions of 

numbers such that 

ao<ai<a8'^ ••• <«, /?< ••• 

Let Vn be the maximum of \nn{x)\ for « < a: < and (ink 
the maximum of the same function for ak<x<fik; t^njo is 
the same as the previous It follows immediately from 
the definitions that 

M'ttO M'nl ^ l^n2 ^ • 

Until the contrary case is explicitly mentioned, it will be 
assumed that Then, a fortiori^ and 

^ jBo(w* by the earlier work. If gn ^ Ayn, as will 

be assumed from now on, g]l^ does not exceed a constant 
multiple of and 

/^no ^0 Bn } 

where Aq is independent of n. 

Suppose it is known, for a specified value of that 

ZOi/c < AkU^ frn, 

Ak being independent of n. The exponent will be denoted 
by cjc when there is occasion to emphasize its dependence 
on k. By Corollary II of Bernstein's theorem, 

(^) j h Hfink 

for < x<fikJ(-\i the factor h depending on fikj and 
but not on anything else. Let .Ti be a point in (offc+i, fik+\) 
such that |nr„(xi)| = and let di == ii*n,/c+i/(2AMit^nfc). 

Because of the fact that tin,k+i ^f*hk, ^ ll{2hn), and it 
is certain that at least one of the intervals {xi — di,xi), 
{xi , a;i+ di) consists entirely of points belonging to (afe+i , 
as soon as n is sufficiently large. This condition being satisfied. 
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,7i«(a;) — 7r„(a'i;i ^ ^ y /‘n.fcfi, 


throughout the interval designated, and as it has been supposed 
that ^ J Vn ^ 1 (*n, fc+l , 


|r(3-) — 7i„(a;)l ^ .9» ^ ^ 4 ^"^ 


.fc+l 


■ 2 htlftfnk 




where Bk+i is independent of n. Taken with the relations 

this means that 


or, if Bk+i(.AAic is denoted by 

/»n,k+l < Ak+i 


Because of a qualification inti’oduced at one stage in the 
work, the result has been obtained in the first instance only 
for values of n from a certain point on ; but it can be extended 
to the finite number of values of n previously neglected, from 
« = 1 on, by a suitable adjustment of the value of ^k+i . 
It is recognized therefore that the successive numbers /»»* have 
upper bounds involving a sequence of exponents ofe , beginning 
with oo = 2 /m, and so related that ffk^-i-' ('^k+l)/(wi+l). 

It may readily be verified by induction that 


tfk- + 


m 


__ 1 

m (m + i)* 


If 1] is any positive number, it will be possible to choose 
a value of k for which tffc<(l/«i) + 7. With such a value 
of k, 


All this, it may now be recalled, is on the assumption that 
irn. But in any case 

P). < 4 «n+ Ak «„ . 

The last relation, by the definitions of and fn, implies 
that 
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— Pn{x) \ = \r{x) — 7I„{x)\ ^ 
for a < x^ /i. More concisely, since 1 for « ^ 1 , 

\f(x)-Pn{x)\ <: 

if C A/c-i- 5, the factor C, like those which have preceded 
it in similar situations, being independent of n . The previous 
results with regard to convergence may be supplemented by 
stating: 

Theorem VIII. Jf q (aj) and 1\ (x) satisfy the hypotheses 
of Theorem VII, and if «o< a < He fig , the polynomials P„ (a:) 
will converge uniformly toivard f{x) for if m>'[, 

and if a positive number exists such that 

Urn o) (d)/d<v»>+’? = 0, 

i )=0 

where mid) is the modulus of continuity of fix). 

Corollary. If qix) is continuous and nowhere negative for 
hypotheses with regard to Pn ix) and fix) 
remaining unchanged, the polynomials Pnix) will converge 
toward f ix) at all points where g (a:) 0 . 

For points in the interior of (a, h), this corollary super- 
sedes the second corollary of Theorem VII, being a direct 
generalization of it. 

There are analogous but less simple conclusions for 
0-^w 1. 

Generalizations of Bernstein’s theorem (in appropriately 
modifled fom), leading to the extension of parts of the pre- 
ceding analysis to certain cases of development in terms of 
characteristic functions of diifercntial systems, have been 
given by Miss Carlson (Transactions of the American Mathe- 
matical Society, vol.26 (1924), pp. 230-240, and vol. 28 (1926). 
pp. 43.5-447). 

5. Polynomial approximation over an infinite 
interval 

Another extension of the scope of the method lies in its 
application to problems of polynomial approximation over an 



102 


THE THEORY OF APPROXIMATION 


infinite interval. To be specific, let / (a:) be a function defined 
and continuous for all real values of x, and not greater in 
absolute value than a constant multiple of a power of a;, as 
X becomes infinite in either direction. Among all polynomials 
of specified degree n, let Pn(x) be the one which minimizes 
the integral 

l-oo 

The proof of the existence of the minimum and the uniqueness 
of the corresponding polynomial offers no new difficulties in 
comparison with cases already treated, and will be omitted. (It 
can be based either on the construction of a sequence of orthog- 
onal polynomials over the infinite interval with the weight- 
function , essentially the polynomials of Tchebychef-Hermite, 
or on the fundamental theorems of real analysis. From the 
latter point of view, the essential point in the passage to the 
infinite interval is merely that if any coefficient in Pn(.r) were 
large, the integral over a finite interval would be large, and the 
integral over the infinite interval would be larger still, so that 
the coefficients to be considered in the search for a minimum 
belong to a bounded domain.) It is to be shown that under 
suitable restrictions on /te) the polynomial Pn {x) will converge 
everywhere to / (:r) as n becomes infinite, and will converge 
uniformly over any interval of finite extent. The aim will 
be to arrive at a result of this character in as straight- 
forward a manner as possible, without regard for the greatest 
attainable generality. 

Let Yn be the minimum value of the integral, and let « 
be an arbitrary preassigned positive number. Let pn{x) be 
an arbitrary polynomial of the nth degree, let 

rn{x) = f{x)~pn{x)y nn{x) = Pn{x) — pn(x), 

SO that 

f(x) — Pn{x) = rn{x) — Unix), 

and let en be the maximum of \rn{x) \ for — («+l)<a:^«+L 
Let Pn be the maximum of \nn{x)\ in the same interval, 
attained for x — x^. 
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By the third corollary attached to Bernstein’s theorem on 
the derivative of a polynomial, there is a positive number H, 
independent of n, such that 

|/rn(a;) — 7rn(a;o)i Hnfin\x — 


as long as x is in that one of the intervals ( — a — 1,0), 
(0, a + 1), to which x^ belongs. This condition will be 
satisfied, on one side of xo at least, for all values of n from 
a certain point on, if |a: — a’ol ^ l/(4^r*n*). So there will 
be an interval of length throughout which 




Trn{x)\ 



If it is supposed that f„ < \fin, then 

' fix) — Pn (a-) I = I r„ ta-) — ( x ) ^ ^ 


throughout the same interval, whence it follows that 


yn > 




16 4H* 


n‘ 


< c^ny 


1/2 


where c, is independent of n. The alternative possibility 
that *n > i may be left out of account until a later stage. 

Now let th, be the maximum of 1 /rn (a;) | for — a < x < a. 
It is certain that /a', Let xi be a point in ( — «, «) 

such that |7rn(a:i) 1 = nh. By the second corollary of Bern- 
stein’s theorem for polynomials, there is an h independent 
of n such that 


' Ji»(x)i < hnin„, j n„{x) — 7r„(a;i)| < hnttn \x—Xi |, 

for — « < .r < a . (It is clear that h must be positive.) The 
quantity dn = ihiK^hniir^ is less than a for values of n from 
a certain point on, since One or the other of 

the intervals (xi — (a:i, a::i+<Jn) is then contained in 

( — a), and there is an interval of length at least dn through- 
out which 

! 7T„ (.r) — 7r„ (n) I < (t) ■ > ft'n . 
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If *n ~ i /<■» » 

I/O 

in this interval, and 


1 /( 0 !) — P„ (a:) 1 = |r»(a;) — T„(a;)l > ^ 




with C 2 independent of n . The assumption that ^ i cames 
with it the condition < i/*„, under which 
so that 

If the hypothesis that is not fulfilled, 

immediately. In either case 

+4f„, 

with the reservation merely that a finite number of values 
of n may have had to be ruled out in the course of the proof. 

As and fn are upper bounds for iTinCa;)! and lr„(x)i 
respectively, 

i/(r) - P„(.i) = i rn{x)-n„ (x) £ + 5t„ 

for — For fixed «, the polynomials i>»(a) can 
surely be determined so that limn- « «n — 0, since f{x) is 
continuous. In order that P»(x) shall converge totvard fix), 
uniformly throughout the interval ( — «, a), whatever value 
may he assigned to a, it is sufficient that lim»^!««^^*yn == 0. 
It remains to discuss the order of magnitude of yn, under 
appropriate hypotheses on f(x). 

There will be occasion to use a lemma regarding the 
behavior of a polynomial as its argument becomes infinite. 
Let pn(x) be an arbitrary polynomial of the Jith degree, 
n > 1 , and let M be an upper bound for its absolute value 
in the interval — 1 1. Let Ak(x), as in Chapter I, 

be the Legendre polynomial of degree k. Then p»(x) may 
be written identically in the form 

p„(x) ^MXo(x)-hAiXi(x)+ ... +A„Xnix), 
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where 


Ak ^ ^ ^ \ ^pn{x) Xk{x)dx, 


From the last equation, together with the definition of M and 
the fact that \Xk{x)\ <_ \ in ( — 1, 1), it follows that 


Ak\<{2k^l)M. 


For 1 x‘ I ^ 1 , on the other hand, the identity 


gives 


Afc (./•) “ ^ j \x-\-{o(? — 1)^^'^ cos (pY d y 
71 Jo 

|X/,(.r)| ^ |2:r|^ 


since (a:* — 1)^^“ |cosy | ^ \x\. Consequently, under the assump- 
tion still that I a: I > I, 

\pn{x)\ ^ M\l + S\2x^ + b^2x\^+ \-{2n + l)\2x\^] 

< Af[l + 3 + 5+ +(2r2 + l)]|2xh 

-- (n + iyM\2x\^<{n-\-7iyM\2x\^ 4ilfn*i2a*h 


If pn (x) is a polynomial of the nth degree, as before, and 
if M is an upper bound for \pn{x)\ in an interval — fi^x 
< fiy where /J is an arbitrary positive number, then Pn{x) 
is at the same time a polynomial of the nth degree in xlfi, 
and for \x\ ^ /9, \x/fi\ ^ 1, 

l2?n(a:)| ^ 4ilfn*|2r//^|”. 

This relation, under the hypotheses stated, constitutes the 
lemma in question.. 

Let it he assumed now that f(x) has everywhere a first 
derivative satisfying the condition 

\f'{Xt)—f'{Xi)\ < i.]Xi — Xi\. 

Let /S be an arbitrary positive number, and let Theorem VII 
of Chapter I be applied to the approximate representation 
of f(x) in the interval ( — fi, /t),. The conclusion is that there 
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exists for each a polynomial pn(x), of the nth degree, 

such that 




2K*JU 

n* 


for — /S < a: ^ /d, where K is the absolute constant that 
figures in Theorem I of the first chapter. 

As an immediate consequence, 


m 



[f{x) ~ pn(x)Ydx < 




C 

~ n* J 


dx, 


or, as the value of the last integral is n' - 


m < 


8jrv*A* 


2 . 


Furthermore, IpnW! ^ for — 

if Ml is the maximuin of in the interval. Let |/(0)| 

= do, l/'( 0 )l =" ai; then \f'(x)\ < ai-\-).\x'. '/(.r)| 

< «o + «i !ic! + i Aa;*, and 

Ml < oo + «./«+ -2 


(Incidentally it is seen that the present hypothesis carries 
with it the fulfillment of the earlier requirement that f{x) 
shall not become infinite faster than a power of x.) If ^ > 1 , 
as will be .assumed henceforth. Mi ^ (oo + «! + i and 


;>»(.'') I o/S* 


for — < X ^ where C 4 is independent of fi and of n, 
though it is different for different functions /(a;) . 

For I a; I ^ by the reasoning of an earlier paragraph, 

At the same time (that is, for la^l ^ ^ 1) 

i/(a')l -fC <e«4- «i Is"! 4- < |oo + «i + -2 
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and b}' further enlargement of the right-hand member, with 
«* == for abbreviation, 

I /(a) I ^ «4 a;* — ^ at fi* {2x1 fi)* < n*{2 xl $)* 

it has already been assumed that n ^ 2 . So, with a new 
multiplier ca, independent of $ and of n, 

l/(a:) — Pn (x) I ^ ^4 c,-f Osjid* n* | 2 x//i I", 

{fix) — Pn («)]* < Cifi*n*:2 x/fi 1“*'* 
for \x \ > i®. 

This means that 

p — p„ix)Ydx <; x?’^e-^dx\ 


the quantity on the right is an upper bound also for the 
integral from — oo to — fi. Since /® > 1, 



ar"fi ^dx<' 



x dx 


1 

•> 


/4CC 

y" (>-" dy 


''tX ^ I ^’(h + 1) 


1 

2 


« (k — 1 1! 


Without reference to more elaborate approximations to tlie 
value of the factorial, it is seen at once that 

pfc+i 

log/f — logfeiff-^J^ logfdf, A=l,2. •••, 

log[(n — 1)!] -- logl-j-log2H hlog(w — 1) 

< log < df -= n log n — + 1 . 

(n — 1)! <e(w/e)’'. 

Consequently 
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and 


On = L/Xa ) — On (a;)]* dx 


< 


8 fi* 
n* 


+ Cse/S*«® 



The number has been arbitrary hitherto, except for the 
i-equirement that it be positive and not less than 1 ; the last 
relation is valid with the insertion of any such value of 
Xotc let — 2 n^i "- . This makes 

//»< i 1- IGrjew’e 

n 

As jt* e~“ approaches zero when n becomes infinite, n’ does 
not exceed a constant multiple of 1/n* for w ^ 2, while, on 
the other hand, On is not less than the minimum value yn . 
Therefore 

J’" J?" = “* > 

with cg independent of n . 

The condition limn=<» n*>^yn = 0 being satisfied, Pn(x) cm- 
vet'oes uniformly toward f(x) thronghont any finite interval, 
under the hypotheses stated. 

As in the case of the Fourier and Legendre series, the method 
under discussion, when applied merely to the classical problem, 
yields little if anjrthing that is new, and misses much that is 
well known. Clearly, however, the treatment admits a variety 
of generalizations, which remain open for further investigation. 
To mention just one, which calls for no additional labor, the 
reasoning applies without material change if the weight func- 
tion e is replaced by any positive continuous function which 
is never greater than a constant multiple of The theorj' 
thus suggested has been developed at some length in a thesis, 
as yet unpubli.shed, by J. M. Earl. Theorems on degree of 
approximation over an infinite interval, without reference to 
the particular method of this chapter, have been published 
by W. E. Milne in vol. Sl of the Transactions of the American 
Mathematical Society. 



CHAPTER IV 


TRIGONOMETRIC INTERPOLATION 

I. Fundamental formulas of trigonometric 
interpolation 

This chapter is concerned with certain stnking analogies, 
both formal and analytical, between the theory of interpolation 
by means of trigonometric sums and that of Fourier series. 
The case of polynomial interpolation will be left out of con- 
sideration for the most part, since the analogies there, when 
the points used for interpolating are equally spaced, ai’e 
rather with Taylor s series than with those of Fourier and 
Legendre. Certain extensions to the case of interpolation 
by means of Sturm-Liouville sums have been carried through 
by C. M. .Tensen (Transactions of the American Mathematical 
Society, vol. 29 (1927), pp. 54 -79). 

Let foj fi) • • •) f 2 » be 2n-fl distinct numbers contained in 
an interval of length 27t, for definiteness that from 0 to 2/r, 
inclusive of the left-hand and exclusive of the right-hand end 
point. Let yo, yi, • • ym be any 2n + l real numbers, 
distinct or not. Let the problem be proposed of determining 
a trigonometric sum Tn ix), of the nth order, to satisfy the 
conditions 

r»(fr)-=yr. »• = 0, 1, •••, 2n. 

The sum Tn(,x) has 2n-t-l coefficients, on which 2n-\-\ 
conditions are imposed. With the notation 

Tn (x) Uo -f Oi cos a: -1- • • • + a,, cos n j- 
+ fti sin a: -f • • • + i)n sin tix, 

the conditions to be satisfied are given explicitly by the 
2n-{-l equations 
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Oo -}” fli COS ^0 • H~ sin fo ~l~ • • * ttncosn^o “f-i>nsin7t^ 2/o j 
6io ”1" cos •-|"^isin^i -j" • • * "f* ^ hCos - hbusinw^i — i/i , 

(to + Ui cos lin + sin ^2n + h an cos 71 t2n + &n sin n t2n ~ y2n , 

which are linear in the a’s and 6’s. One is confronted by 
the problem of showing, directly or indirectly, that the deter- 
minant of these equations, the determinant 


1 

cos <0 

sin ^ 

cos nto 

sin nU 

1 

cos 

sin ti 

cos ntx 

sin n tx 

1 

cos 

sin hn • • • 

cos n tin 

sin nt 2 \ 


is different from zero. 

If a and are any two numbers, the product 
sin ^{x— a) sin ^ {x — $) 

is a trigonometric sum of the first order in x’, being identically 
equal to 



= Aq-\- Ax cos X “h sin X , 


with Ao-= i cos (i a — i /J), Ax^ — i cos (i a + i /«) , 
Bi— — ^ sin a + ^ . In the expression 

^k{x) = 

sin ^ (x •— <o) » . » sin \(:x— tk-i) sin ^{x— tkJri) ■ - sin ^ (x -- fen) 
sin i {tk— • sin i {tk— tk-i) sin \ {h— tk-^i) • • • sin i (tk ~ hn) ' 

the 2 n factors of the numerator can be combined in pairs 
to give n expressions, each of which is a trigonometric sum 
of the first order, and the product of these is a trigonometric 
sum of the wth order ; the denominator is a constant which 
is not zero, since the difference of any two of the numbers U 
is by hypothesis different from zero and numerically less than 
2 71 . It is apparent on inspection that = 0 for r fc, 

while yfc(ffc) = 1. Consequently the function Tn{x) defined hy 
the formula 
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2il 

k=0 

is a trigonometric sum of the nth ordei- such that Tn{tr) — yr, 
~ 0, 1, . • 2 n. The linear equations above accordingly 

have at least one solution, for any assigned values of the y’s. 
But if the determinant of the equations were zero, there would 
be values of the right-hand members for which there would 
be no solution. Jt is certain therefore that the determinant 
is different from zero. Whatever values arc given to the y’s, 
the proposed problem has one and just one solution. In par- 
ticular, if -all the y’s are zero, the obvious solution in which 
all the a’s and b's are zero is the only one. A trigonometric 
sum of the nth order which vanishes «< 2 n + 1 distinct points 
in a period is identically zero; tiro trigonometric sums of the 
nth order which coincide in value uf 2 n + 1 distinct points of 
a period are identically equal. 

It is easy now to supply a proof of a fact which was pre- 
viously assumed as known, in connection with the proof of 
Bernstein’s theorem, namely that a trigonometric sum of the 
nth order vanishes identically if it has 2 n distinct roots in 
a period, one of which is a double root. Let T,, (x) be a sum 
of the nth order which takes on the value 0 for 2 n distinct 
points of a period, say for .r -- h, U, ■ ... t-m, but which does 
not vanish identically. Let U be a point of the same period 
interval, distinct from h, •••, t-m. and let yo — it is 

certain that yo + 0, since Tn{x) can not have 2 n-f- 1 distinct 
roots in the inteiwal. The expression 

sin ^ (x — <i)sin ^ (x — U) • • • si n ^ (x — 
sin i (<o — k) sin i «o — U)--- sin i «o — t^n) 

s a trigonometric sum of the »ith order which takes on the 
value yo for x = fo, and vanishes for x =- ti, • • - , tin. and so 
must be identical with r»(x). The derivative of this expression 
is easily calculated explicitly; it is 
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For X = tr, if r has any of the values 1, • • •, 2 m, all the 
terms of the summation vanish, except the one for which 
/.• =: r, and that becomes 

__ sin — f i) • • • 1 • • • sinK <r— fen) 

sin^(«o — h) ••• sin ^(< 0 — tr) ••• sinK<o — tzn) ‘ 

which is certainly different from zero, so that Tn (tr) = i yo Z>r + 0 ; 
the sum Tn(x), assumed not to vanish identically, can not 
have a double root at any of the points ti, •••, tm- 
To return to the problem of interpolation, M the points U 
from now on be supposed equally spaced over a period.' As 
a matter of general notation, if m is any positive integer 
and r any real integer, positive, negative, or zero, let 

tr = 2r7ilm. 

The discussion hitherto (apart from the digression of the last 
paragraph) is applicable on the assumption that m is odd, 
m — 2m-|- 1; there will be occasion subsequently to consider 
even values of m as well. Throughout the remainder of this 
chapter^ in the absence of express indication to the contrary, 
the sign 2 understood to refer to summation with 

respect to the index r, from r 0 to r — m — 1, or what 

comes to the same thing, as all the functions considered will 
be of period 2rt, over any m successive values of r. 

It is a fundamental fact, whether m is odd or even, that 

^ sin k tr — 0 

if A: is any integer, and that 

y] cos ktr — 0 

if Ac is an integer not divisible by m , while if /c is a multiple 
of m (and in particular if Ac — 0) it is evident that 

yi cos ktr == m, 

since in this case each term of the summation is equal to 1. 
The truth of the statement with regard to ^ sin Ac <r is 
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apparent from considerations of symmetry. The term sin fcfois 
equal to 0. Since 

k tni—r 2 k {m — r)7E/ m = - 2k n — k tr , 

eiirli term corresponding to an index r between 0 and 
is paired with a numerically equal term of opposite sign. 
If m is odd, all the terms are thus accounted for; if m is even, 
the term which remains by itself is sin ktmi^ sin fc/r = 0. 
Ill connection with the cosine sum it is to be recalled that 

} + cos u “f cos 2 M + • • • + cos n u = — - , 

^ 2 sin i 1 / 

(The' right-hand member is understood to be defined by con- 
tinuity at points where the denominator vanishes.) If u = , 

then nt ~ ktr. and 

\ + cos k ti + cos k ^2 + • • • + cos k fn = . 

2 2siniA‘ti 

But by the symmetry pointed out in the preceding paragraph, 

the terms on the left are respectively the same as those of 

the sum 

^ cos ktm I ~t" cos k (m -2 COS k fni n • 

If m — 2?i+l, and if k is not divisible by m, addition of 
the two sums gives 

sin(n-f^i)^^ 
sin^/vti ’ 

and in the last expression the numerator is zero, since 
— 27i/m and n + i = .Jm, so that (n + i)7t^ ^ kn. while 
the denominator is different from zero. If m = 2n, the tenn 
cos ftfn = cos occurs in each sum; combination of the 
two, with subtraction of the redundant term, gives 

V* I . sin(N + i)kti 1 . 

cos ktr ^ . v-f - — cos nkti 

" smi/tf, 

sinwfc^i cos ^fc^i+cos?ifc7, sinifr/i 

Sin ikfi 

— - sin n/cfi cot -^^-fc/, = sinfr/r cot(A:7r/m) = 0, 
the assumption being still that A- is not divisible by m. 


y cos A tr =-' 
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The proposition of which the preceding lines give an 
analytical proof is almost evident geometrically, thougli 
a precise formulation of the geometrical argument requires 
a little attention to detail. The pairs of numbers (cos^r, sinfr), 
r = 0, 1, • • •, m — 1, are the coordinates of the vertices of 
a regular polygon of m sides having its center at the origin, 
and the quantities (l/nO^cos/r, (1/wO^sinfr, being the 
coordinates of the center of gravity of these points, must 
evidently be zero. The more general fact of the vanishing 
of ^ cosA-fr and ^ sinfe/r is then obtained by consideration 
of the various possibilities as to the existence of common 
factors of k and w. 

In the following statements, let i) and q be integers subject 
to the restrictions 0<p<iw, 0 < q ^ . Then, in the 

identities 

cos^jj* cosrj'j* [cos(p — + cos (p 4 - 7 ).r], 

sinpj; siiif/.i — 2 -[cos 0 > — q).r — cos (;> + g)a;l, 

^inqx ~ ] [sin(p + g)a — sin (j; — q)x\f 

p — g and p + g are integers numerically less than m , except 
that p-\-q— m if mi is even and p — q ~~ so that 


2 cos P tr cos qtr - 0 
2 sinp<r sin qtr = 0 
^ cos ptr sin qtr = 0 

^ cos* otr -= y sin* ntr = - wj 

^ cos* pir ~ ni, ^sin*i>tr = " 0 


if P + 3. 

if p + q. 

for all p and q, 

if O '.p-C I m, 
if - 0 or 2 ”*• 


Until the contrary is stated, let it be supposed now that 
m is odd, m = . The equations for the interpolating 

coefficients given in the second paragraph of the chapter 
can be solved explicitly, under the present hypothesis of 
equal spacing of the points U-. It is known in advance that 
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they have one and just one solution. If the 2n+l equations 
are added as they stand, the left-hand member in the result 
reduces to (2w + l)«o, so that 

(As to the convention with regard to the meaning of the 
it is to be understood here that the summation is 
performed specifically over the range r = 0, or 

else that yr is defined outside this range as a periodic 
function of the index by the prescription thatyr-i 2 n+i == yr-) 
To determine aic, k>Q, let the (r+l)th equation be mul- 
tiplied by cos frfr, r = 0, 1, •••, 2n. On addition of the 
equations thus obtained, the left-hand member in the sum 
becomes ajc^ — imak— i{2n + l)aky whence 

ak — '2n + 1 ^ 

Similarly, 

■“ ^-1- i' ^ 

If the notation is modified to the extent of representing the 
constant term in the interpolating sum by rto/2, instead of Oo, 
the general formula for ak gives the correct value of a. also. 
It will be understood henceforth that the notation is adjusted 
in this way, and the interpolating sum for equally spaced 
points tr will be denoted by >S„ (x), so that 

Sn (x) = 2 "f" ^ ■ ■ ■ "I" 

+ bi sin x-\- . • • + 6» sin nr. 

the a’s and V& being given by the two preceding equations 
for A: =- 0, 1, • • •, n. The resemblance to the Fourier coeffici- 
ents is apparent. 

Even if it had not been known a priori that the equations 
have a unique solution, that fact would be an immediate 
consequence of the work of the last paragraph. For the 


»• 
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work shows that if the equations have a solution, it must be 
the one indicated; that is, they can not have more than one 
solution for any given set of t/’s. But if the determinant 
of the equations were zero, there would be values of the y's 
for which there would be infinitely many solutions. This new 
proof of the non-vanishing of the determinant, however, 
depends essentially on the assumption that the fs are equi- 
distant, and so is less general than the one previously given. 

The formal resemblance to the case of Fourier series is 
further borne out by substitution of the values of the a’s 
and 6’s in the expression defining Snix)^ and rearrangement 
of the result by means of the identity for a sum of cosines; 


2 r 1 ^ 

i^n{x) ~ -- — ^ ^inktrsmkx) 

I Z k-\ 


2n + 
2 

2n + 
1 


2n + 


■f [ 2 + COS fe (^r — a)| 

sin(n-|-0(<r — a;) 

YZy’-- 


sia i(<r— O’) 


Incidentally, the correctness of the last expression as a solution 
of the problem of interpolation can be verified directly, by 
substituting x = tqj for any particular value of q. For 
-^)7r/(2w + l), sin(ii-fi)(^r ”/g) = sin(?* -g)7r = 0, 
and all the terms of the summation vanish except the single 
one with a vanishing denominator, while the limiting value 
of the quotient of sines in that term is 2n+l? so that 

= yq» 

Now let the problem be changed by supposing that m is 
even, m = 2n. Corresponding to the 2n abscissas i, 

let 271 numbers yo, • • *, y 2 n~i be given, and let yr be defined 
for other values of r so that yr+ 2 n “ yr. Consider the 
question of the existence of a trigonometric sum Tn{x), of 
the wth order, represented ior the moment by the notation 
of the second paragraph of the chapter, to satisfy the con- 
ditions Tn{tr) = yr, r = 0, 1, • • - , 2n—\. It is not to be 
expected that the problem will have a definite solution, since 
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in 


only 2n conditions are imposed on the 2n+l coefticients. 
The precise nature of the indeterminacy is brouglit out by 
going through the formal manipulation of the equations. 
Written out at length, these are as follows: 

flo + «! cos to + hi sin to H 1- cos n f„ + sin n to 

■ yo, 

rto + fli cos <1 + hi sin ti -1 1- «„ cos m <i + h« sin n 

= 2/1 > 


«o + «i cos /i„_i + hi sin Un - 1 + 1- cos n hn-\ + K sin n fcn-i 


Direct addition of them gives 


— 2/2n-l. 


'2nao — 2yr, «o = i^yr)l2n. 


Multiplied respectively by cos ft to > cos ft/,, •• •, cosftt 2 «-i, for 
any value of ft from 1 to n — 1 inclusive, and then added, 
they yield 

tik = H/n)2yrCOskfr, 

and similarly 

hi; — {I I n)^!/r sin ft 

So far there is no ambiguity, and no formal difference from 
the case previously treated, except the replacement of 2m+1 
by 2m. When the (r4-l)th equation is multiplied by cosn/,-, 
there is a difference because in the present case n = 
and ^cos’ntr is equal to m = 2n, instead of ^m. The 
deteimination is still perfectly definite, however: 

COS«/r)/(2«). 


But as tr - rn/ti, the numbers sin n/r are all zero, and the 
use of these quantities as multipliers does not lead to any 
deteimination of h» at all. 

It is now apparent how the problem of interpolation with 
an even number of equidistant abscissas can be formulated 
so as to have a determinate solution. If denotes an 
expression of the form 
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- 00 + cosa^H h On -1 cos(n — l)x+ ^ nx 

+ 6i sin a; + j- hn-i sin (n — l)a:, 

the conditions SniU) — yr are expressed by 2n linear equations 
for the 2n coefficients. The solution of these equations, if 
they have a solution, is given uniquely by the formulas 

Ofc — (l/n)^^r cosit^r, hk ~ {I / u) ^ yr sijx ktrj 

for A: = 0, 1, • • •, n. But if the determinant of the system 
were zero, there would be values of the y’s for which more 
than one solution would exist. So the value of the determinant 
is certainly different from zero, and the problem has the 
unique solution indicated. As sin n/r = 0 for all values 
of r, the expression 

Sn{x) + hn sinna’, 

with any value of bny likewise takes on the value yr when 
X — tr\ the indeterminacy of the problem as first proposed 
consists merely in the complete indeterminacy of the coef- 
ficient hn* The subsequent discussion will relate to the sum 
Sn{x) as defined at the beginning of this paragraph, with- 
out the term in sin nx. 

For deriving a concise expression for Sn{x)y to be sure, 
it is convenient as a matter of form to include a term 
^trisinna:, the coefficient hn being defined by the general 
formula for hjc, with k = n\ as each term of the summation 
vanishes, this gives hn = 0 automatically. Then it appears that 

I r 1 ^ ^ 1 1 

Sn{x) = ^ COS A:(fr — ^ COS x) , 

n I ^ k i ^ J 

or, as 

~ +COSU+ ••• +cos(n — l)w+ 2 cosna 
: 2 2 

by subtraction of \ cos nii from the identity previously used, 

“ .y ^yr sin w — x) cot } (tr — x ) . 
n u 


Sn {x) 
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As in the case of an odd number of points, substitution of 
x= tg gives a direct verification of the validity of the formula 
for purposes of interpolation. The analogy with the partial 
sum of a Fourier series, liowever, is not superficially so much 
in evidence as before. 

2. Convergence and degree of convergence 
under hypotheses of continuity over entire period 

For the discussion of the analytical properties of the 
interpolating sums Sn{x)y particularly questions of convergence 
as n becomes infinite, let the given numbers ijr be values 
of a function having specified properties: let f{x) be a given 
function of period 2n, and let Then Sn{x) 

coincides in value with f{x) at the points U, and the question 
is to what extent (./•) furnishes an approximation to f{x) 
at intermediate points. 

In the first place, let /(a) be a bounded function having M 
as an upper bound for its absolute value. Then a corre- 
sponding upper bound can be assigned for \Sn{x) . Let 
m — 2n + I. If a: has one of the values U, i Sn (x) 1 = l/(^) i 
< M, If X is not one of the numbers tr. let in be that one 
of these numbers which is nearest to x. or one of the two 
nearest, if two are equally near. Then 


tu — 


71 

2 4 " 1 


^ a; < tu-\- 


7T 

271+1 ' 


Because of the periodicity of the functions concerned, as has 
been noted already, the summation with regard to r in the 
formula representing Sn{x) can be extended over any 2w + l 
successive values of r, and in particular from R — 7 i to R+ n 
inclusive. Accordingly 


Sn{x) 


I + f(i\ sip(w4-^) (^r— a:) I 
2«4-1 \rJk-n sini(<r— I 

< ^ sin (m -f (<, — x) j 

= 2n+l r^R-n ‘ sini(i'r— x) i’ 
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The absolute value of the last expression in bars never 
exceeds 2?t -K iis may be seen from its interpretation as a sum 
of cosines. This observation will be sufficient as far as the 
three terms corresponding to the indices r — R — l,i?, iZ+l 
are concerned. The numbers — x, — x, •••, — .r 

are respectively greater than 2;r/(2n+l), 4;r/(2M + l), ••• 
• ••, 2(n — l)7r/(2n4‘ 1); the same thing may be said of the 
sequence of numbers /«_•> — r , tu-i — x!, •••, | <K_n — x\; 
and for all the values of r in question, | U — rl < n, so that 
j sin ^ (tr — r) I > (1 /tt") I — .r [ . Hence 


I sin ! 

r‘^-n' sinl(tr— a") i 

<3«„ + .,+ (2 +2 

~ \> r H hll \ Sin i (tr— J') 

. K-2 R^n . . 

< 3(2« + l)4- Z + Z - .-tvT M 

\rs=H~-n r = R-\-2l | sio ^ (^r — x)\ 

. ie-2 V H 

< 3(2« + n + n- z Z -r-^vi 

) R''l! tr — 


X) 


< 3(2n4 1)4(2 H + 1) ’X ^ = 4(2»<41)4(2n4l)!Z ’ 

j=i j . - - 


)i— 1 


j 1 


/'n-l A„ 

< 4(2n4l)4(2n41)j, ‘i(2«4ll4(2n4l)log(j/ — 1 ) 

< (2 n + 1) (4 4 log w), 
and 

< il/(441ogn). 


The last parenthesis does not exceed a constant multiide of 
log TO, for «>2, and <S'„(.r)| does not exceed a constant 
multiple of M log n . 

Similar reasoning, with minor differences of detail, applies 
when m = 2w, the sum *9»(a;) then being given by the formula 
of the third paragraph preceding. The conclusion may be 
stated comprehensively for both cases as 

Lemma I. If f(x) is afuncUon of period 2 n satisfying the 
condition that 


\nx)\ < M 
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for all values of x, and if Sn(x) is the interpolating sum of 
the nth order for fix) corresponding to the subdivision of 
a period either into 2 m + 1 or into 2 n equal parts, then 

< CMlo^n 

for n>l, where (J is an absolute constant 

The representation of the constant by the notation used 
in the lemma preceding Theorem IX of Chapter I does not 
imply that the constants are the same, though of course both 
lemmas could be stated together, with a single symbol to 
represent the larger of the two constants. 

For the application of the lemma, it is to be noted that 
|/( 2 r) — -S„(a-) 1 M-\-CM logn, which is likewise not greater 
than a constant multiple of ilf logn, say i^J/log n ; that the 
interpolating expression corresponding to the sum of two given 
functions is the sum of the interpolating expressions con- 
structed for the two functions separately, and the error of 
the sum is the sum of the errors ; and that if m = 2 n -f 1 
the interpolating sum Sn («) formed for a function f(r) which 
is itself a trigonometric sum Tn (x) of the wth order is iden- 
tical with I'n W, since by reason of the interpolating property 
S„(x) and Tn{x) are trigonometric sums of the nth order 
coinciding in value at 2 n -f 1 distinct points of a period. 
Taken together with the lemma, these observations 3 rield 
Theorem I. If f{x) is a function of period 2 n. if S«(x) 
is the interpolating sum of the nth order for f{.r) corresponding 
to the subdiiisioti of a period into 2 n-f 1 equal parts, n>l, 
and if there exists a trigonometric sum Tn (x), of the nth order, 
stich that 

\f{x)-Tn{x)\ ^ *„ 

for all values of x, then, for all values of x, 

|/(a;)-— ^ J?«„logn, 

where B is an absolute constant. 

When m = 2n, the interpolating sum of the nth order, 
as defined above, for a trigonometric sum Tn{x) of the nth order 
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is not in general the same as Tni^x), since the intei'polating 
sum does not contain any term in sin nx. But the inter- 
polating sum /SnfiC'*') order n-f 1, obtained by taking 
m =■ 2»j-f 2, is identical with T„{x). For the coefficient 
fln-ri in given by the expression 

[1 /(n + 1)1 ^ cos (n -f 1) tr, 

reduces to zero, since each of the sines and cosines in 
through siiiwr and cos WcC, is orthogonal to cos(w4-1)^p for 
summation over the finite range in question; and it follows 
that Sn+if r) and 7nU*) are trigonometric sums of the nth order 
agreeing in value at 2 n + 2 distinct points of a period. The 
above statement may therefore be supplemented as follows: 

Theorem 1 {continued). If <Sn 4 -i W the interpolating mm 
of order ?i + 1 correspondhig to the subdivision of a period 
into 2n + 2 eqwd parts, the hgpntheses remaining otherwise 
muhanged, then 

(x) < Btn log (n 4- 1) . 

ahere B has thv same value as before. 

As log + 1) < 2 log n for w > I , the right-hand member 
may be replaced by 2 Be,, logu. As an alternative, n may 
be replaced by u — 1 , to give 


:/(•»•) — < Bfn~i\ogn 

for m --- 2w. 

Like Theorem IX of Chapter I. the present Theorem 1 can 
be combined with Theorems I -IV of Chapter I to give a suc- 
cession of more specific results. The cases of an odd number 
and of an even number of subdivisions can be covered by 
a single formulation each time, by virtue of the observation that 
l/(n — 1) ^ 2/« and w[27t/(n — 1)] ^ w(47r/n) ^ 2w(27r/n), 
for « > 1 . The symbol A is merely a notation for the largest 
of what would be obtained in the first instance as a finite 
number of different constants, and Ap similarly; 

Corollary I. If 

fixt) —f(x,) ^ A I a;* — a;, , 
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/or all values of .r, and X being a amsfant, then 
:f{x)-Sn{x)\ < 

for m — 2 n + 1 anrf for m = 2n. 

Corollary II. If f{x) is vontinuom mth modulus of von- 
tinnity w(^), 

f{x) — <S» {x) < -1 w I I log n 

for = 2?^+ 1 and for m = 2n. 

Corollary Ila. The interj)olating sttm SnU) vonveryex 
uniformly to the vahie f{x), as m becomes infinite through 
odd or even values {or both), if f{x) has a modulus of con- 
tinuity w(^) such that («)(3) logrf 0. 

Corollary III. If f{x) has a pth deriratire such 

that 


for all values of Xi and A being a constaJit, then 


f{x) - Sn{x) 


Ap?. log n 

■ > 4T ■ 


for m = 2u+ 1 and for m ~ 2n, 

Corollary TV. If f{x^ has everywhere a rontmuous 
pth derivative with modulus of continuity w(di). 



for m — 271 and for m = 2n, 

hi each of those statements, the conclusion holds for all 
values of j\ and for all values of n ^ 2; the coeftiaent A is 
an absolute constant, while Ap depends only on p. 


3. Convergence under hypothesis of continuity 
over part of a period 

The neit proposition is analogous to a result obtained in 
the second and third paragraphs of Chapter II: 
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Lemma 11. If /{x) is a function of yet'iod 2n, hounded 
and integrahle in the sense of Riemann, 

lim(l/m) ^f(tr) cosn^r = lim (1 /w) sinner 0. 


as m becomes infinite throvujh odd values {m =- 2n-\-\) or 
even values (rn — 2 m) or both. 

There is a noteworthy difPereiice in the hypothesis imposed 
on /(x); it is in the nature of the case no longer sufficient 
to assume merely that the function is summable, or sumraable 
with its square, since an enumerable set of very large values 
of fix), while not affecting the definite integrals of tht‘ 
earlier chapter, might throw the present sums entirely out of 
proportion. 

Suppose first that m — 2 m 1 . Let the summation be 
extended for definiteness from r — 0 to r ^ 2n. Since 
fr = 2rn/{2n-\-l)y 


2rn7i 2r(w + ^)/r — rn _ 

2 m - f"! 2M-j-l 2 m-|~1 


COSMfr = ( — 1)^ cos 


rn 

2n + l ^ 


1 


2n 


2m -f- 1 r = 0 


2 fiV) cosri^ 


2m + 1 


Ji_ y fi \ 

271 j—Q 2 m '-|“1 \ 2 m -|- 1 / 


(2; — l);r 
2n-}- 1 


When n becomes infinite, the first sum in the last member 
approaches 



COSM du, 


by the definition of integrability, and the second sum has 
the same limit, so that the whole expression approaches zero. 
(Neither sum as it stands is exactly of the form that would 
ordinarily be written down in defining the definite integinl, 
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but the discrepancy in each case amounts to half of a single 
terra, and approaches zero in the limit.) As 

sinjifr = (— 1)’'+^ sin j , 

•similar reasoning applies t(> the expression 
[1 / (2w + 1)] ^/(tr) sin w/r. 

if m = 2n, the summation going from r ---- 0 to r = 2n — 1 . 
ir -- rrt/n. cosnfr ~ cosj-.t ^ ( — I)’’. 
---^/(/,)cos ntr 

1 II r 

Each sum in the last member approaches 

and the whole expression approarhes zero. As for the other 
j'xpression in the statement of the lemma, sin nfr=^0 throughout. 

Suppose /(,/•) is a function of period 2 -t. bounded and 
integrable in the sense of Riemann, which vanishes for 
'0 + '? • If will be seen that IS,, Uq) coniergtx 
to the value 0. whether the values of m entering into the 
definition of the sums are even or odd. In both cases. 

.s’„(x) can be represented, artificially but with readily verifiable 
accuracy, by the single formula 

^ sin n(fr — .r) cot ^ (/, — r) 

+ 2 m ~ I^JVr) cos « Ur -- .C). 

As cos nUr — a;) = cos nx cos n tr 4- ^in nx sin n tr, the last 
sum, taken with the factor J/(2»n), approaches zero uniformly 
for all values of j-. And as /(f) cot Hf — •'«) is bounded 
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and integrable in the sense of Riemann as a function of ^ 
the first factor vanishing where the second factor is large, 
application of Lemma II to this function shows that the first 
part of the expression for Sn{x) approaches zero for x ro. 
If two functions, each bounded and integrable in the sense 
of Riemann, are identical from iCb — to Xo~\-rj, the difference 
of their interpolating expressions converges toward zero at Xo ; 
if f{x) is any function of period 2 n, hounded and mteyrahJe 
in the sense of Rienxann, the convergence of the corresponding 
interpolating sums Sn (x) at any specified point, as m becomes 
infinite through odd or even values, depends only on the 
behavior of f{x) in the neighborhood of the point in question. 

Now let f(x) be of period 27t, bounded and integrable in the 
sense of Riemann, and identically zero for a — rj < x < 
and let attention be directed to the problem of showing that 
8n{x) converges toward zero uniformly for a ^ x ^ It 
was noted in the last paragraph that Sn{x) either is given 
identically by the expression 

^ 2f(ir) sin n (U — r) cot (U — x ) , 

17% u 

or differs from it by an amount which approaches zero uniformly 
for all values of as n becomes infinite. Furthermore, the 
terms resulting from the expansion of sin n{tr — x) may be 
considered separately, and as the factors sinna; and cosnx 
are bounded, it is sufficient to demonstrate the uniform con- 
vergence of the expressions 

r) cos 71 tr COt ^ {U X) , 

m /u 

^ 2'/(^r) sin n tr cot It (U x) , 

m 2 

as X in the argument of the cotangent ranges over the inter- 
val (a, >J). 

Suppose in the first place that m 2n+l. Let x be 
restricted to the interval Let C'(x, t) be a function 

which vanishes when | ^ — x\ differs irom an integral multiple 

of 27r by less than rj, and is equal to cot — x) elsewhere. 
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Then f(t) C(xj t) is identical with f{t) coti(^ — x), under the 
restriction imposed on x, while C(x, t) is bounded for all 
values of x and never exceeding cot^iy in absolute value, 
and is iS-integrable as a function of U By adaptation of 
the formulas of an earlier paragraph, 

[27r/(2r2 + f)] cos n irC(x^ U) 

is equal to 

y fL^. '1- \ cos - — r> [r -4^-1 

2?t + l‘' \2n+l/ 2n + l ^\ ’2n + \! 

minus another expression of similar form. Let uj ^ 2^' tt / (2?i + 1 ) , 
y = 0, 1 , • • n, and let Auj 27r/(2n'f 1) for j = 0, 1, •••, 
n — 1, while Aun = 7r/(2n+l)* Then the sum just written 
down differs merely by the quantity 

71 ‘^nn \ 27X71 . / AfiTT \ 

2n-f 1 ^\2w + l ) + l \ ’ 2n+l)’ 


which approaches zero uniformly for all values of x, from 

H 

^ JViXj) cos nj C’ix, 2xij) A uj, 


a sum corresponding to a subdivision of the interval (0, tt) 
into n-\-l parts, not all of equal length, and defining the 


definite integral 



cos u r(x. 2u) dll 


in the limit. 


The essential point for the present argument is that the 
sum differs from the integi'al by an amount which approaches 
zero uniformly with respect to x. Let Mj and mj be the 
least upper bound and the greatest lower bound of the 
integrand over the Jth sub-interval, Fj and pj the corresponding 
bounds for the factor /(2h)cos?(, and Qj and qj those for 
the factor C{x, 2u). The difference between the sum and 
the integral does not exceed 



mj) A Hj, 



128 


THE THEORY OF APPROXIMATION 


Let M be the least upper bound of |/(2?t)|, as « varies 
without restriction, an upper bound consequently for i/(2 n) cos u { , 
and Q.(r= cot J i/) the least upper bound of ! 0(x, 2h) Then 

Mj-mj < M((^-gj) + W^j-pj), 

since for any «, and Us of the sub-interval in question 

f{2ut) cos «s t'{x, 2f<g) — /{.2iti) cos C(x, 2mi) 

— /( 2 hs) cos Hi (C(a:, 2 «g) — C(x, 2 mi)] 

-1- C'(£r , 2 «i ) [/(2 «*) cos Wj — /(2 i<i) cos «i] . 

As /(«) is by hypothesis integi’able in the sense of Hieniann, 
/( 2 m) cos H is 2?-integi-able also. Let t be an arbitrary 
positive quantity. By the property of integrability (more 
specifically, by Darboux’s theorem) it is possible to choose n 
so large, and the intervals dnj in consequence uniformly so 
small, that 

J-O J » 

On the other hand, 

n 

2 (Qj — f/j) 

can not exceed the total variation of C{x, 2 it) as n ranges 
over the interval (0, x), a total variation which is finite atuf 
indeptyid&nt of x, being equal to 4(,^. Hence if m is taken 
so large that 27r/(2M-l- 1)< f/(yyM), it follows that 

^ . 1 / (fij • qj) ^uj ■ . ' . 

j a * 

So there exists an A, independent of x, such that for n > N 

n 

The other sum analogous to the one just treated is 


= 1 


2^ ./2(2;— l)7r\ 

2n-l-P\* 2w-fi / 


(2; — l)7r 


2 (2 j— 1)71 

2w -f i 


)• 
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By addition of the term 


-T .I'2('2h — 1)'^ 

+ 2m + 1”'/ 


cos 


(2m — 1)-T 
2m + 1 ' 


2 (2 w — 1 ) /r \ 

2»r-fi /’ 


which approaches zero uiiiforml}', this becomes a sum ap- 
proximating the same detinite integral as before, and corre- 
sponding to a subdivision of the interval of integi-ation into 
71 — 1 parts each of length 27r/(2n + lj and one part of 
length 3. T/(2?i + !)• The difference between the sum and 
the integral approaches zero uniformly, and the difference 
between the two sums does the same, which means that 

2mVi 2 

converges uniformly toward zero. The (Muresponding ex- 
pression with sinu^ in place of cos ntr can be treated in 
tlie same way. This completes the proof of uniform con- 
vergence of S„(j') for odd values of m. 

For m - 2 n, the discussion of the sum containing cos ntr 
is slightly simplitied in form by the fact that cos u fr reduces 
to ( — 1)^ and by the fact that is an exact submultiple 

of but otherwise follows essentially the same lines as 
before. The sum with sin ii ft vanishes identically for all 
values ot u, since sin // fr 

The conclusion is embodied in 

IjKMMA 111. /s fmirtiini of ihoiotl 2.t trhtch is 

hounded nnd i7itej/rnhti> in fho sense tf htiemtom. and irhivli 
rnnishes identiealhj for « — tj < ./• the vor resi>ondni(j 

interitolatiiiff sums S,t (./) vonrenje nmfortnt/j toward zero for 
u jr < /i, as m tteromes infnite fhroar/h odd or ece)t ca/aes, 
or both. 

On the formal side it may be pointed out in i)assing, and 
might have been noted before, that for ni even S„ {.r) has 
the alternative representation 

— sin n r^f{fr) cos 71 /,• cot (/,• — ./*) . 


0 
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though the appearance of sinnj; as a factor of the whole 
expression is illusory to the extent that coiresponding to 
each value of x for which sinna; = 0 there is a factor 
cot^(fr — x) which becomes infinite. 

Lemma III may be combined at once with Corollary 11a 
of Theorem I to give 

Theorem 11. If fix) is a function of period 2 a, hounded 
and integrahle in the. sense of Riemann over a period-, mid 
continuous for « — r <. d-\- 1 / with n modulus of eou- 
tinuity w(d) such that lim(^:=ow(d) logd — 0, the correspondiuy 
interpoiatiny sums Sn(x) converge uniformly toward f(u) for 
a ^x-^ fi, as m becomes infinite, through odd or even values, 
or both. 

4. Convergence under hypothesis of limited variation 

There are theorems of convergence for functions of limited 
variation, analogous to those obtained in the case of Fourier 
series. Let /(j) be a function of period 2/r, with limited 
variation over a period, and let .v have a value, to be regarded 
for the time being as fixed, such that f(f) is continuous for 
t = X. It will be shown that <S'„(r) converges toward f(x) 
for the value of x specified. 

Let fjPi (/) and y* (<) be the positive and negative variations 
of /(<). measured from the point t =- — 2n. say. so that 

/(<) == /(— 2 + 9>i (0 — Vi (0, 

while Vx iind y* are monotone increasing, and continuous for 
t = x. (The point x may of course be thought of without 
loss of generality as belonging to the interval (0, 'in), and 
then there will be no occasion to take account of values of 
t less than — 2ar.) Consider the case w/ -— 2«+l. If x 
is one of the numbers U, for a specified value of n, Unix) -/(./ ) 
exactly. Otherwise, let Ir be the number which is nearest 
to X (or one of the two such numbers, if two are equally 
near), so that 



IV. TRIGONOMETRIC INTERPOLATION 


131 


Let the summation over r in the formula representing 
be extended from — n R-\-n. The corresponding formula 
with f{tr) replaced by 1 represents 1 identically, since 1 is 
a trigonometric sum of order zero. If this identity is multi- 
plied by the quantity f{x\ independent of r, and the result sub- 
tracted from the formula for Sn{x), the difference 
is obtained in the form 


^n{x) —fix) = 


I R^n 

YnTlr^-n 


sin(n+^K^r — x) 
sin \ iU — x) 


l^he right-hand member is equal to the difference between 


I K-fn 

2 ^ XT -2/ [yi i^r) 
in-t' ^ r—R-n 


y, (x)] 


si n (n-f i) {tr — x) 
sin i {tr — x) 


and the similar expression with 9 t{tr) — 9 i{x) in place of 

'/’l (<r) — 9i i^)- 

The quotient of sines never exceeds 2 w + 1 in absolute 
value. In the term corresponding to the index r — B, fjt 
approaches x as n becomes infinite, and approaches 

9 i{x), by the continuity of g>i, and the whole term, multi- 
plied by 1/(2 n -f- 1), approaches zero and need not be further 
taken into account; it is sufficient to consider the sum from 
if -f 1 to if + w and the corresponding sum from B — n 
to if — 1 . 

Since tn+j — x ^ (r — x 4- [2j 7 t /(2 n + 1)] , 


sin 



— x) = sin 




= (— ly sin|n-i- g-j (tR — x), 


and is independent of j, except as to algebraic sign. If 
i is denoted by uj, and if c stands for 0 or 1 

according as sin (n + i)(^i 2 — a*) is positive or negative, 


1 


R-\-n 

2n-fl 


sin (w-4- \){tr — x) 
sin ^ Ur— x) 


^ {-lyZi-WAjBj, 

9 * 



132 


THE THEORY OF APPROXIMATION 


where 

Aj — V', U- + 2 »;) — SPi 


3iii(n+ — ac) I 
(2n-f I)sinM; 


The numbers 4;* 5/ are all positive or zero. As j increases, 
A; increases or remains unchanged; is always between 0 
and nl2, sin Uj increases, and Bj decreases. Let V be the 
total variation at fix) over an interval of length 2 tt . Then 
the A’s have the upper bound .J V, since the positive and 
negative variations of a periodic function over a period are 
each equal to half the total variation; while Bj, which is 
equal to the absolute value of 


^ sin (n_-t- .V) ( tu^ j — ./•) 

(2w -h 1) sini(f«4j— r) ’ 
can not exceed unity. 

After the analogj’ of the proof in the case of Fourier series, 
where the second law of the mean was employed, it would 
be natural here to use the method of summation by parts. 
The formulation can be slightly simplified, however, by reason 
of the fact that the sequences Aj, Bj are hath monotone, 
though varying in opposite senses. In general, let a, , • • •, np. 
hi, ..., hp be two sets of p numbers each, satisfying the 
conditions 

rti ^ = ■ • • ^ ^ 0 ■ h, ■> /;, hp > 0 . 

Let fj ~ hi — hj, so that 


0 = c, < ... < <■„ < hi . 

Then 

— ^ ( — IV ttj hp—j-^i - - n, hp — bp~i -j- . . . 

J 1 

= «i 1^1 — — Hi {hi — (>-i) H 

— hi {(It — <l» ■{■ • • •) — (^*1 Cp — (>a Cp- 1 . . .). 


Since both the a's and the c’s are non-uegativc and decrease 
monotonically, the value of each parenthesis in the last 
expression is positive or zero, and not greater than its leading 
term, in one case ai , and in the other case a, cp, which in turn 
is not gi-eater than o, hi. So the whole expression is the 
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difference of two non-negative quantities, neither of which 
can exceed Ot bi , and is itself not greater than at h in ab- 
solute value. To emphasize by words rather than by subscripts 
what is essential in the conclusion, thr nhsoliiU value of 

( 1 V aj hp—jj. 1 

j « 

does not exceed the pi'oduct of the largest of the a's hg the 
largest of the h’s. 

Let e be an arbitrary i)ositive (|uantity. Con'esponding to 
the continuity of y>i , let d >0 be chosen so that j 5 Pi(t) — 9 t.{x)\ 
^ for |t- r < d. Let the sum — iVAjBjot the 
second paragraph preceding be broken up into two, 2' and 
in the first of which J ranges over the values for which 
2 < d, while the remaining terms make up the second. 

In 1-1*6 largest value of Aj is not greater than i « , while 
Bj never exceeds 1, so that by the conclusion of the last para- 
graph, with suitable adaptation of the subscripts, - S ^ • 
In 2''- Aj< i F throughout, while < I / [(2 « -f 1) sin ^ d] ; 
consequently 

■2!" < I']/[2(2?j-l-l)sin | dj, 

which is less than i t as soon as « is sufficiently large. 
Hence or, in the earlier notation, the sum from 

ii+l to B + n, multiplied by l/(2n4-l). approaches zero 
as M becomes infinite. The sum from B — it to B — 1 and 
the corresponding sums with y* in place of y, can be treated 
in the same way, to show that S„ (x) — /(a) converges 
toward zero. 

If f(x) is continuous everywhere, the uniform continuity 
of yt and y*, bearing on the choice of d in the preceding 
paragraph, and, as a detail, on the convergence toward zero 
of the single tenn in each snm for which r - - B, yields at 
once uniform convergence of (a?) toward /(a:). 

Finally, a precisely similar argument can be carried through 
for the case of an even number of interpolating points, 
m — 2«. 
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The conclusion is 

Theorem III. If f{x) is a function of period 2 tv having 
limited variation over a period^ Sn{x) converges toward fix) 
at every point where fix) is continuous^ when m becomes in- 
finite through odd or even values^ or both] if fix), still sup- 
posed to be of limited variation, is continuous everywhere, the 
convergence is uniform for all values of x. 

This result can be generalized immediately by reference 
to Lemma III, and the more elementary fact that convergence 
at any point depends only on the behavior of the function in the 
neighborhood of the point: 

Corollary. If fix) is a functmi of period 2 tv, bounded 
and integrable in the sense of Riemann ovei' a period, of 
limited variation for xq — t} < x < + ct^^d continuous 

foi’ X = Xq, Snixi)) converges toward /(^o); if fi^) i^ cou- 
tinuous and of limited variation for a — V ^ ~ 

the hypothesis remaining otherwise unchanged, Snix) convergis 
toward fix) imiformly for o ^ x ^ jS, 

5. Degree of convergence under hypotheses 
involving limited variation 

Let fix) once more be of limited variation over a period, 
with total variation V. Let yiCr) be the positive variation 
of fix) from 0 to x when j;>0, and minus the positive 
variation from x to 0 when x<i), and let f^{x) be the 
correspondingly defined negative variation function, so that 

fix) = /(O) + fi (x) — n ix), (0) 9>2 (0) 0 . 

Let m — 2n-\-l, l 0, Consider the sum cos n/r, 

extended specifically from r — — n to r ^ n. The sum 
2/(0) cos n^r vanishes. In ^fiiU) cosnU, the term corre- 
sponding to r = 0 vanishes with g>i (0) . It has already been 
observed that cos n^r = ( — 1)^ cos [r/r/(2n + !)]• Ir the 


n 


r -- 1 


y 1 (^y*) COS nty y 


sum 
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the factor yiCfr) increases monotonically, having the upper 
bound (tt), while, as r 7r/(2 w -|- 1) remains within the limits 
(0, 7r/2), jcoswfrl decreases monotonically, having the upper 
bound 1. Consequently, by the reasoning of an earlier 
paragrapli. 




r 1 


(pi (fr) cos n/r 




Similarly, the absolute value of the sum from — w to — 1 
does not exceed \ <pi ( — n)'\. So the whole sum from — n 
to -f« does not exceed 

y I (^) + i 9>i (.— ?r) , yi O'O — y I (— tt) = ^ V. 

After analogous reasoning with y* in place of yi , it is con- 
cluded that 

|^/(fr)eOS«fr| ^ V. 


There is a con esponding inequality 

|^/(fr)sinn/r < V, 


the proof being simplified in this case by the fact that the 
monotone sequences yl(^.) , isiiuj^. vary in the 

same sense. 

For m~- 2 n a still simpler calculation, based on the tact 
that cos n U - - ( — 1 V'. shows that i^VXW <'os ntr\ ^ \\ while 
2:/(<*-)sinMfr 0. 

Disregarding the special simplicity of the last observation, 
it is possible to state comprehensively 

Lkmma IV. If fix) is (I function of period 27t, with Jimifvd 
variation over a periodj its total variation being V, then 

i^/(fr) cos wtr! ^ | ^/(^r) sin W ^ T", 

whether w is odd (m ~ 2w-f 1) or et^en (m = 2n). 

Let fix) be of limited variation over a period, with total 
variation V, and identically zero for a — ^ ^ ^ ^ 
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It has already been pointed out in substance th«Tt if Om is 
a symbol denoting 1 or 0 according as m is odd or even. 
iV,, (.r) satisfies the identity 

m S„ (a) cos n.r ^ fit,) cot {f, — a) sin ii fy 
- sin ./*(//) cot (fy — .r)cos>i/,. 

+ Cm <*OS H .V fy) cos U ty 1 ' Ota sill H ./' sill U ty 

For any value of ./• in («,/#). the product ./’(OcotA(/ — a) 
is of limited variation, regarded as a function of /, and its 
total variation does not exceed the product of Tby a (piantity 
depending only on So the application of Lemma IV to 
each of the four sums in turn leads to the 

(^OKOiaiAHY. If f{.r) is it fnnvf ion of jHoi Of I 2.T irifh hoiitcil 
rarintioUf ilte ioini rnriafion orrr it jirnoi/ hrini/ ttnil if 
fix) ritms/tos iiU^nfivitllif J or u fj • x + fftnt 

S,(r) ‘ (’^Vin 

for « * _ a' * /d, irJirrr i/i jnoiils otihj on ry. 

This Corollaiy in turn may lx* associatml with Corollary 11 
of Theorem I, in the light of the discussion leading up to 
Theorem Va in Chapter II, to yield 

Theohkm IV. If thr fnnrtion f(x). of pm Of! 2;r, ts ron- 
tinuoits iritli inoiliihts of ronfintt it tj oi{d) for « 
irlicrc w(d) for d (), anil of Hm if vd ritrntfnnt oror flu* 
H'sf of It jirrioif, fitrn 

f(.r) — ( /•) VO ('2x/n) log n 

for a X - if n is fitri/i* vnoitifli so fftitf w(2/f/x; fins 
It mmnintp r firintj it ronsfimf tr/tirlt ifrprnifs uvifhrr on x 
nor on n. 

I.emma IV is analogous to Theorem III of diapter JL tlH‘ 
sums in the present hmnna. multiplied by a quantity of tlu' 
order of 1/w to represent the length of the interval between 
successive i»oints U, coiTCsponding to the integrals in the 
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earlier formulation, though as the length of the intei*val is 
either exactly or approximately nhi, according as m is even 
or odd, the agreement does not extend to the numerical values 
obtained for the constants in the right-hand members of the 
ine(|ualities. It is sufficiently noteworthy, however, that the 
agreement should be as close at it is, since for any particular 
value of n the number of points used is not large enough 
to make it all evident a priori that the sums give close 
approximations to the values of the integrals, in view of the 
presence of the factors cos u./’, sin «,/• under the sign of 
iirtegiation. 

The analogy can be extended to the conclusions of Cor- 
ollary I of the theorem referred to. The proof is perhaps 
most readily given by means of the Corollary itself. Suppose 
f{x) has a pth derivative with limited variation. Then fix) 
is represented by a convergent Fourier series: if the Fourier 
coefficients are denoted by «*, dk, 

f (jO - •> + .2/ («fc cos l;j- dk sin A- j ). 

“ k- 1 

The series may be used to represent fitr) in evaluating the 
sums 2^ fit,) cos ntr, 2 fi^r) sin nU, Let nj = The 

expression ^ cos A tr cos n/,- vanishes unless k — n or A--i-n 
is . an integral multiple, of 2 w -f 1 • that is, unless A- has one 
of the values n, «-|-l. 3 7i-|-l, 3 774-2, 5 7i-f-2, .'>77-1-3. •••. 
The viilue of the sum in each of these cases is (2 77 4-11/2. 
The sum ^ sinfr/r cos >jfr is zei’o in all cases. Consequently 

^fity) C0S7tfr [(27l4”ll/2] («n+«H-^l4~C‘Hn^ l4'««n+24 1* 

Similarly, 

^./"(Msin ntr— [(277-|-l)/2](Ai — — Ain+24 •• 

Hut by the Corollary cited, 

lafc < 17(71 fcf+H, A- ; < F/(7TfcP+i), 
if V is the total variation of (o') . So 
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-f- 

7T [ 

-f- 

7t 


I «»+ «a/»+2 • • • ! 

-J_ . _J_ + L_ . + L— +...1 

^ («4-i)p ^ (3 «4-l)»’+‘ (3n+2)*’+i ^ I 


< JL I — t 1 _ 


‘(3n)Pi i (3 n)P^ 


iTnft-i ^ 3P+‘ ^ 


+ ... 


the series in the last pair of brackets being convergent for 
p>^l, and having incidentally, for ^ 1, the upper bound 

1 4- 1/3*4- 1/5* -1 . independent of p. A similar inequality 

holds for the series of /J's . If m --- 2 n . 


ZAtr) fos nfr= 2 n (,««4- «Sii4- -I >,• 


and 0. Whether m is odd or erm, there- 

fore, 

i2)/(^)coswt,.; < rr/wP, '^/(/,)sinw/r • ('yini\ 

ivlieir C is nn ahsolnto const ant. 

This result, supplemented by a type of argument whicli 
has been used a number of times already, gives at once 
certain information with regard to the degree of convergence 
of interpolating sums Sn{j'). 1^'or example, if f{:r) has a first 

derivative which is of limited variation over a period, and 
if /(.r) vanishes identically for a — tj < ./• ' ^ + then 
Sn{^) ean not exceed a constant multiple of l//r for (c 
if /W has a first derivative which is of limited variation 
over a period, and if fh) is continuous with modulus of 
continuity w(d) for « — // < x ^ but not identically 

constant over the interval, then !/(a:) — S,i{x) can not exceed 
a constant multiple of (1/w) w(27r/w) log w for « < .r < fi. 
But such observations are of secondary interest; and it is not 
possible to pass over immediately to a proposition analogous 
to Theorem IV of Chapter II, for the transition from a specified 
interpolating sum to one of higher order does not con- 
sist merely in the inclusion of additional terms directly subject 
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to the inequalities that have been under discussion. For the 
problem thus suggested a difterent procedure is required. Tin* 
detailed treatment will be limited to the next ease in order, 
that of a function having a first derivative of limited variation 
over a period for expressible as the integral of a function 
of limited variation), and not otherwise restric^ted. 

The key to the discussion of this case is the fact that if 
f{x) is the integral of a function of limited variation over 
an interval (f x />, the quotient [/(./•) — /(o)]/(./ — o) 
(defined in any way, by the value /(o -l-) or otherwise, tor 
X — ff) is likewise of limited variation over the interval. Let 

fU) ^ /(„) + r c; U) ./r, 

where -■ and f/i and iire bounded 

non-decreasing functions from ft to h. Let 

X ft 

for ft while and let a function (fK(f) 

be similarly defined in terms of v’sU')« Then, for ./’ ft. 


/(•'*) 

./• — ft 


(Pj (./ ) — Ipj (./■) , 


The assertion with regard to the quotient on tlie left will 
be proved if it is shown that CPi(a ) and are non- 

decreasing; the fact is rather obvious, and in fovmulating 
a proof it is cl(»arly sufficient to consider one of the two 
functions. For any value of ./• ft. 


(./ — 0) (fl) ‘ 

rJa 


(.c - 


by reason of tlu‘ monotone character of 1 , so that 
U) Vi (<0 I-et xi and x^ be any two values 

such that ft- .^Xi- Xi - I). Then 

— (.1*1 — ff) A', , 

( ^/'i (‘C) dx ~ ( ~t" f ~ ‘ * 

•JU t/n ft/.r*. 



140 


THE THEORY OF APPROXIMATION 


where 

So 


^ yiUi) ^ Xa < SPife)- 



fi{x)dx > {xx 


-a)Xx + {x,-Xx)Xx 


{x>i — a) .Yj . 


whence ^i{xt) ^ A\; as Xi the desired conclusion 

is established. Similarly. [f{x) — /(/>)]/(/> — x) is of limited 
variation. 

Now let f{x) be of period 2/r, and (for simplicity of 
statement) provided with a first derivative of limited variation 
over a period. The function (.r — r?) cot i (a; — a), defined 
so as to be continuous forx a, has a continuous derivative 
for a ^ a-\-7i ^ and so is of limited variation over this 
interval. Hence the product 

{[/(a')— /(«)]/U' — a)} {(a.-— a) cot * (a; — «)[ 

= [/(a)— /(«)] cot (x— «) 


is of limited variation over {a,a-\- >0, since this is true of each of 
the expressions in braces. Similarly, (x — a — 27t) cot ^(a: ~o) 
is of limited variation for a + ^ ^ ^ o -f 27r ; as f{a + 27v) ~~ f{a ) , 

the expression \jXx) —f{a)]/(a-\-27T — x) is the same as 
[f{x ) — /(a + 27r)]/(a -|- 2 :t — x)^ which is of limited variation 
over + a + 27r); and so [/(a*) — /(rt)]cot^tx — a) is of 
limited variation over this interval also. In summary, and in 
slightly differentnotation, the expression f{x)\ cot x), 
regarded as a function of /, is of limited variation over any 
interval of length 27r. 

Let the inteniolating sum for /(.r) be expi essed once more 
in the form used in connection with the Corollary of Lemma IV. 
The intei'polating sum for a constant reproduces the constant 
identically; since f{x) is a constant with respect to the index 
of summation r, the quantity given by the formula reduces 
identically to mf{x), if f(tr) is replaced by f{x) under the 
sign of summation, and 
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/(a-)] 

= cos nx 2^ l/(U) — /(a?)] cot ^ (/y — x) sin ?i tr 

— sin nx^ L/ (^r) — cot {U — x) cos n tr 

+ Oni cos n :r V \J'{Q — /U‘)l cos 71 tr 

+ e,n sin 71 X ^ [/(tr) —/(x)] sin W tr- 

Hut /(O “/(./•), as well as [f(t)—f(x)]coti(t — x), is oi 
limited variation with respec.t to /, and consequently Lemma IV 
is applicable to each ot the sums on the right. Closer 
inspection shows that the total variation in each case is not 
greater than a constant multiple of the total variation of 
/'(x) over a period. If this fact is incorporated in the 
statement of the result, the conclusion may be formulated as 
Theorem V. Jff(x) is n ftnivtum of period 2 tz having a first 
dericntivv trith lnnifvd rariafion, the total variatio 7 i of f (x) 
onx a prriod heiftg 1\ thfOi 

f(x)-Snix) -kCV/n. 

irhrrn C is aa absolate noistant. 

The corresponding analysis for the case of functions having 
liiglier derivatives will not be carried through here. It may 
be j)ointcd out, however, as a first stage in the extension, 
that if f(x) has a continuous non-decreasing second derivative 
for a x < />, then [./(.r) — / (u)] /(./’— ti), considered to have 
the value /'(o) for j' = a, has a non-deereasing first derivative 
over the same interval. For .r ff, 


/(-O— /(«) 

(lx j — a 



as may be seen by calculating the derivative from first 
principles and applying the extended mean value theorem in 
the process. For 

/ > 1 . ")/ ~ [/<_•*■) 

dxV X — (( J (x — o)* 

-- 2 /"(?) ^ 2 -^’"^“^’ 
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tlie mimber ? lying between a and a\ For a, furthermore. 
T/U)— /(a)1 ^ 

fijr^ [ Jf' — (I J C^’ — ^0^ 

. r(^-rm > 0 . 

./ — a 

The monotone character of the first derivative thus becomes 
apparent. It follows that if f{x) has a continuous second 
derivative of limited variation, [./'(^‘) — — ^0 hns 
a first derivative of limited variation, and connection can be 
made with the facts previously ascertained as to functions 
satisfying the latter condition. 


6. Formula of interpolation analogous 
to the Fej6r mean 

A considerable part of Chapter 11 was devoted to a dis- 
cussion of the arithmetic mean of the i)artial sums of the 
Fourier series. There is a corresponding formula in the case 
of interpolation, [possessing many analogous properties, with 
the outstanding exception that it is not an arithmetic mean 
of a sequence of the interpolating sums previously studied. 
It is to be defined and examined on its own merits, witli 
only incidental reference to the ctpiitent of the earlier [part of 
the present chapter. 

Let n be an arbitrary positive integer, and let fr 2r>r/n. 
for any integi-al value of / . In comparison with the earlier 
notation, m is now to be taken ecpial t(p n, instead (pf 27i r I 
or 2w; the sign 21 will be understood to refer to summation 
over H successive values of the index /•, when there is no 
indication to the contrary; a separate .symbol m is no longer 
needed; and there is no occasion to distinguish between odd 
and even values of w. 

I jet f{x) be an arbitrary function of period 27r. 'Phe intei- 
]) dating formula in question is 


= lA'r) 
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with thft understauding that each tern is defined so as to be 
continuous wherever its denominator vanishes. For ./• — l,„ 
one term of the sum reduces to f(tg), namely that in which 
/• - q, or in which r differs from q by an integral multiple of n, 
while each of the other terms becomes zero, and consequently 

So the interpolating property is apparent at the outset. 

It is an almost immediate* conseeiiience of identities previously 
employed that 

(sin* 2 «'■)/ (sin* ‘ <•) 

— — l)cos/ -4-(«— 2)cos2r-| 1-<-0 .s(m — l)rj. 

Hence, as a result of the substitution r f, — 

' <!> + (h cos 2 4- • . . -f- cos (n — 1).^ 

4- Ih sin .)• 4- ^>2 sin 2 ./• 4- • • • 4- kt i sin (?i — 1 ) ./• . 
with the coeffleients 

2(/< — 1{) . 

(fh {fr) COSA fr, 

tr 

fe_, 

The expression is a trigonoinetrie sum, but of order about 
twice as hi^h as would be required merely for the purpose 
of obtaining coincidence at n points. 

Jf f{.r) 1, all the coefficients reduce to zero except f/„. 

while ^(U) 1, so that in this case o',, Or) 1. or, it 

(sin*?«/0/(sin^u) is denoted by (»). 


sin* .1 >1 ((r — .'■) 
u*sin*.l(fr--.' ) 




1 . 


Hence it follows further, inasmuch as <&»(«) is never negative, 
that if /(.r) i.s any function having M as an upper bound 
for its absolute value. 
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[•fnU-) 


^ (M, n*) On ^ (/,. — y)J = 1 = M. 


Like the arithmetie mean associated witli the Fourier series, 
the present converges uniformly toward f{x), as 

n becomes infinite, if /(a-) is everywhere continuous, and 
converges at points of continuity under more.general hypotheses 
as to the behavior of the function elsewhere. Inasmuch as 
the sums depend on the values of the function at isolated 
points, however, it is necessary to impose some restriction 
on the values which it may take on point by point, not 
merely to require that it be summable, or in other words 
that it have a finite mean value. 

Let f(t) be continuous for t — .v. and let f(t) < M 
everywhere. By a device already used on a number of 
occasions, the error of tfn(.r) can be expressed in tlu; form 

(j) —/i f) «*) [/(fr) --/(•'')] [2 ~ 1 • 

Let e be an arbitrary positive (juantity. and let d be a positive 
number such that f(f) — f{x) for t-~.r < 6. Let the 

sum in the right-hand member of the identity for — /(a) 
be written in the form where denotes summation 

over those values of r for which — ./• differs from an integral 
multiple of 27t by less than d. and stands for a summation 
covering the remaining values of r. In 2^', /{(,) — fix ) ; < i * . 
so that 

^ 2;'(l/«*)[/ «r) -fix)] a)„[ ‘ (/,- a-)] ; 

< Z (1 /n‘) (*/2) On [ 2 ('r - ^)] = 2 ‘ • 

In^", — -c)] ^ l/(sin*id), and \f{lr)—/(x)\ < 2M, 

while the number of terms can not exceed n , and consequently 
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^"(1 hi *) [/(/,•)—/(./•)] ^ (/, -.'•)] ' il 2Ml{}i sin* dj . 

Avhicli is less than i t as soon as n is sufficiently large. This 
establishes the fact of convergence. It is sufficient that n 
surpass a bound de])ending only on M and d ; if f(x) is con- 
tiiuioiis everyAvhere. d can be chosen independently of j\, 
and the convergence is uniform. If f(jr) is continuous for 
H — ijf ./• ^ . without being necessarily continuous 

everywhere, it is possible to choose a 6 ^ which shall be 
valid for all values of x in the interval a < x ^ and 
the convergence is uniform over the latter interval. Finally, 
if /(./•), instead of btung merely' continuous for a — < x 

< /? + iy, is identically zero there, and if x is given a value 

belonging to the interval (a, fi), I/(sin*i^) 

in all terms in \yhich /(/,-) { 0, and 

^ siir 

Tlie results may be summarized in 

Thkoukm VI. ff /(J^) (( hmmdrd funviion of period 2 /t. 

^n{^) ionver<fen ioivnrd f{x) ai nery point (it which f(x) is 
continuoas, Jf /{x) is voniinuons rrcrywitvre. the converyeniv 
is uniform everywhere. If f{x) is continuous for u — i] ^ x 
!r.- conreryenrr is uniform for x ^ fi. If 

f{x) IS ideiiticuUy zero for u ~ x ' (^n(x) 

< Cy^Mhifor a X -- u'here M is un upper hound for 

ly'(r)', and C'jj is u. consfunt dependiny only on t,. 

Further discussion of the degree of convergence of On (./*) 
will be limited to the case in which /’(./) satisfies the condition 

fit'i) — /(-'i) > ‘i -- 'i • 

Suppose first that this condition is satisfied everywhere. Then 

(.r) -Au-) > (A «*) 2’ t. - [ J {fr - .'•)]. 

For any particular value of .>■, let tu be that one of the 
numbers tr which is nearest to j-, or one of the two nearest, 
if r is equally distant from two of them, so that tu — (-t/h) 
^ a; ^ ttf + i^/n). Let the summation be thought of as 


10 
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extended specilically over the n values ol* r for wliieh ./ — -7 
< + The precise expressions for the extreme values 

of r in terms of K and n will vary according to circumstances; 
they will be approximately and it is sufficient for 

the purposes of the present argument to note that they will 
certainly be between Jit n and h, for any )i ^ 
ue of i\ 

(H— l)cosr-f(H— 2)cos2/ f • • • I 


For any va 



and hence 


< 2 I ^ 71 -flu - l)-f (H-'2)-f • I j - 


Furthermore, 0 ^ /v'sinir < for 
tris interval, consequently 

^ / 1 \ _ r . sin i nr \ ; 

^ “ ( 2 / sin hr \ sin \ v ’ * ' 

and at the same time (for r f 0) 


sin 


H r 


Throughout 
V. . u • 1 . 


r 



1 

V . 


— -siir nr 
sin- i i* 2 



r 


These relations are to be used in connection with the 
inequality for j — /{ j ) in the second paragraph preceding. 

If ./■ is one of the numbers tr, - fU), luid there is 

no further question as to the magnitude of the error. If ./• 
does not coincide with a fr, theie are just two values of r 
in the .summation for which fr — ./* .'2tt/)i, In the terms 

of the sum corresponding to these values of r it is sufficient 
to apply the relation \r <Z>;,(4 r) /?rr, or (r — /• — ^*)] 

< 7171, For the other points fy, in the order of increasing 
distance from ./• on each side, the values of ,Jr — .r| are 
successively greater than the numbers 2n/n, 
and by the relation r 0,^(1 v) < the corresponding 

values of tr — x On[\(tr — x)] are respectively less than 
7i7x/2y nrtlA^ nn!^^ .... Hence, as the sum involves not 
more than n points U on each side of r, 
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2 I ir J' 1 2 j 

< 2 w /I + 2 [n 71 12 -\~7i7i/4-\-n tt/G + • • • ~f 7r/{2n)] 

.= «-[2 + 1 +.‘ + |- + -.+ ’] 

' n TT I = 71 IT (G 4“ log 7l) , 

which for 7i " 2 docs not exceed a constant multiple of 
n log n . This makes it i)Ossible to state 
Thkohkm VIT. If is (iftmrfioif of period 2 it safisft/fnf/ 
nrrt/irhoro fho (‘nn(mio7i 

''2 j 

t/fon, for i\U rahies of n ^ 2, 

f{.r) — (s„ (r) ‘ - {CX log 7i)in, 

tvhoro f' fs an ahsoluto nnisianf. 

The conclusion may be generalized at once by combination 
with the last assertion of Theorem VI, to yield the 

CoHOLLAHY. //V hoHudvd fimrfion of pt'viod 2 it 
satirfyiyifi tJu' rondition 

'^fW < ^*2 — *^1 

tlirnay/iouf the iyiforral a — .r * /? + // , fhr)i 

/W — <rn(r) j ' - {cX log n)/7i 

for « i_ irltero r is a constant dcpmdmg neithcy' on x 

nor on n. 

In conclusion, it may be pointed out as a peculiarity of 
the interpolating sum Onix) that its derivative vanishes at each 
of the points t,j (See L. Fejer, Gdttinger Xachrichten (1916), 
pp. 66-91, especially pp. 87 -91). For in the identity 

‘rn W -- Z (l/«‘) fitr) On [ ^ “ •>•)] 

it is evident from the expression of <Pn in fractional form, 
or can be verified by differentiating this expression, that 


10* 
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®»U(^ — ^O] has a double root for j- — tq if /,• is not con- 
gruent to iq modulo 2/r, while the representation of — a;)] 

as a sum of cosines shows that it also has a vanishing 
derivative for jr — iq. 

7. Polynomial interpolation 
As was indicated in the opening paragi-aph of the chapter, 
the methods that have been set forth are not adapted to the 
study of the problem of polynomial interpolation with equally 
spaced points. That problem is analogous rather to the theory 
of Taylor’s series, whether treated by means of Taylor's 
theorem with the remainder for real variables, or by (’auchy's 
theorem in the complex jtlane. A simple, change of variable, 
however, serves to carry over the fomulas of trigonometric 
interpolation to a case of polynomial interpolation with 
unequally spaced jaiints distributed in a certain way. Suppose 
namely that a function /(j) is defined for — 1 ./• < 1 . Then 

/ (cos 6) is a function defined for all values of n . It is an 
even function of d. and inspection of the formulas defining 
the coefficients in the interpolating sums 6 'h( 0) and shows 
at once that these sums involve only cosines, and so may 
be regarded as polynomials in cos H. If r - cos 0. .S’,, (fl) 
and a„(m may be denoted by P„U) and Uni.r) respectively. 
They are polynomials agreeing in value with f (./■) for a set 
of values of x corresponding to equally spaced values of f>. 
The polynomial .t,, (j ), unlike the interpolating polynomial of 
minimum degree for ecjually spaced values of x, converges 
in the case of every continuous function f(j)\ its degi’ee how- 
ever is approximately twice as great as the number of points 
for which coincidence is obtained. The expre.ssion Pn{x), on 
the other hand, is an interpolating polynomial of minimum 
degi’ee, and while it is not convergent for every continuous 
function, it converges far more generally than the correspond- 
ing polynomial with equally spaced points. It is unnecessary 
to enumerate the further theorems on convergence and degree 
of convergence which would be obtained by following out 
the transformation of variable in detail. 



CHAPTER V 


INTRODUCTION TO THE GEOMETRY OF FUNCTION SPACE 

I. The notions of distance and orthogonality 
In Chapter III, attention was directed to the problem of 
the approximate representation of a given function by means 
of linear combinations of other given functions, according to 
the criterion of least squares. If fix) is a given function 
over an interval (a, 6), and if p 2 (x), •••, pm(^) are 

711 linearly independent functions over the same interval, the 
coefficients ci , • • • , Cm in an expression 

(p ix) Cl Pi (x) + rj Ps tr) H h Cm Pm U) 

are to be detennined so that 

shall be a minimum. The value of the integral is taken as 
a measure of the discrepancy between the functions f(x) 
and y(ir). 

The problem is the same in principle as that of the niethod 
of least squares for the approximate solution of a set of linear 
equations. Suppose there are n equations in wi unknowns, 
?i > w : 

ffll + ^21 ‘^2 • • • “h y 

^12 •'''l "f" *^2 -f- • * • ’Xm — y 

ffln Xi + H + (lmnX,n ~ hn . 

The set of numbers (/>i, ^ 2 ? • • •• W niay be regarded as con- 
stituting a function 6(fr) bky in which the independent 
variable k takes on only a finite number of values, the 
integers from 1 to n. In the same way, each of the sets 
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{f(i\ j (ir2, • ‘ , Urn) n^ay be regarded as a function The ./'s 
then are the coefficients in a linear combination of these 
functions, 

wj, ~ a'{h) — ••• "h (/i), 

and are to be. determined so that 

i (/>;, 

k -\ 

shall be as small as possible. The sum of squares this time 
measures the discrepancy between the approximation and the 
function approximated. 

The form of the sum suggests a geometric iuteri>retation, 
in which th<‘ liinctions ir(h') are represented by points 

in u-dimensional s]>ace. with coordinates (^0 • and 

{fi'y, Ui, (Cn) respectively, and is the square 

of the distance lad ween these points. By an extension of 
the same idea, the functions /(j*), 7- (r) of the first paragraph 
are thought of as corresponding to points in a space of 

infinitely many dimensions, with J [yXO — </^(r)]“ as the 
square of the distance between them. This definition of 
distance is the beginning of a systcnnatic geometry of func- 
tion space. 

Another hint of geometric analogy which is present from 
the beginning consists in tlie recurring ust* of the term orthnf/- 
rnwh two functions being called orthogonal to each other 
over an interval wlimi the intc*gral of their product over the 
interval is zero. \Vliil(3 it is worthy of some emphasis that 
orthogonality of the functions which form the basis of the 
a])proximation is a convenience rather than a necessity, the 
notion of orthogonality is inseparably associated with the. 
least-square condition in another way. 

Let the functions /(./), (a?), 7;^ (.ij be. supposed con- 

tinuous, for simplicity of illustration. (It would be sufficient 
that they be integrable together with thenr squares, provided 
it is assumed that the p’s are properftj independent, in the 
sense that every linear combination of them containing a non- 
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vanishing coefficient is different from zero over a set ofpositiie 
meastire-, this condition is of course satisfied automatically if 
they are linearly independent and continuous.) /n order that 
<1 > • • • > Cot be the voefjkienls giving the least-square appreadma- 
tion, it is necessary and mfjiemit that /(;r) — y(x) he orthog- 
onal to each of the fnnctimis p, (x), or in other words it is 
necessary and sufficient that fix) — <f (x) he orthogonal to 
every linear combination of pi ix), • ■ p^ix). This can be 
verified algebraically, without a detailed examination of the 
necessary and sufficient conditions for a minimum in the 
calculus. 

Let »/'(c) be an arbitrary linear combination of the p's 
with at least one non-vanishing coefficient. Then 

w(.r) y(.r)-l-/< ) 


is a linear combination of the p's, for any value of h, and 
every linear combination different from g> can be written in 
this form by a suitable choice of if>. Let 


1 =-£[fi^)-vOWdx, 

X b f*b 

[fix) — [[fix) — c/' (rr)] — h tlx, 

li r [fix)— <f'ix)]iff(x)(Ix, -S' == f [ll'{x)y(lx'^i), 

e/rt f/rt 

Then J - ■ I — h{21i — hS). If 4^ 0, /j can be chosen so 
as to make J< I (for example by taking h li/S), contrary 
to the supposition that is the minimizing function, while if 
R = 0, J - - /-b i for every /- { 0. 

The conditions J* [/(x) — </■ (x)] i>,(x) rf x = 0, / 1 , 2, • • • , m, 

are equivalent to the m linear equations 


.Zo r Vji/)lhix) dx f{x)p,i.r 

j'=\ vff 


for determining the m <’s. It has been shown that these 
equations must be satisfied if the least-square problem is to 
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be solved, and that the problem will be solved if the equations 
are satisfied. For the moment there is still a question whether 
the equations have a solution. The answer is immediate, 
however; the left-hand members are independent of the func- 
tion f{x)y the least-square problem obviously has the unique 
solution Cl (*2 ('m = 0 in the particular case 

/{a') ^ 0, and consequently the determinant of the coefficients 
must be different from zero. If the />’s were not lineaidy 
independent, the problem of approximation could still be solved 
in terms of a linearly independent subset of them, and this would 
be at the same time a solution in terms of the original set 
of jy’s, but the solution in terms of the latter set as a whole 
would not be unique. 

Similar reasoning is applicable to the problem of the second 
paragraph of the chapter. Integrals are to be replaced by 
sums throughout, and in particular the property of orthogonality 
of two sets of numbers is expressed by the vanishing of the 
sum of the products of corresponding numbers of the two 
sets. The hypothesis of linear independence of the corre- 
sponds to the condition that the matrix of the coefficients a,fc 
be of rank m. The condition of orthogonality characterizing 
the least-square solution takes the form that the set of 
residuals hk — u:k is orthoqonal to each of the m sets of yimnhers Oik' 

n 

^ [hk — U'k) (Uk - 0. 
k=^ 1 

The it\i equation of this set may be constructed by multi- 
plying each of the given equations by the corresponding co- 
efficient of xi, and adding the equations thus obtained. Written 
out at length, the new equations have the form 

I y Uik (lucj Xi -j- I ^ Uik a2k^ *^*2 4“ • • ■ “4" X<r)i 

n 

Uik hky I ^ 2, • • •, 7a. 

k 

They are the icell-known normal equatiom" for the solution 
of the problem of least-square adjustment. 
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111 certain cases the necessary condition of orthogonality 
can readily be translated into another familiar form. Consider 
once more the case of functions of a continuous variable r. 
Let f{x) and q{x) be given functions for a ^ x < h, for 
simplicity continuous, and le^ q{x) be non-negative and not 
identically zero over the interval, and let Pn(x) be the poly- 
nomial of the nth degree which minimizes the inteoTal 

r%C'^) [/(.') -AW]* d.f. 

Then the remainder f {x) — Pnia), if 'not identically zero ichere- 
erer q must change sign at least n + 1 times in the inter- 
val {a, h). If is denoted by q(x), the integral to be 

minimized is the same as /[<z('^)/W — qix)Pn{x)Y dx, and 
the problem is that of approximating q(x)f(x) by a linear 
combination of the functions qix), xq (x)^ • • • , x^^ q{x) , By 
the general proposition obtained above, it is necessary that 
q{x)f{x) — q{x)Fn{x) be orthogonal to every linear com- 
bination of the functions x^qix), or, in other words, orthog- 
onal to qix)Qn{x). if Qn(x) is an arbitrary polynomial of 
the nth degree; in symbols, since q(^)i 

i Q C^') [/ W — Pn (.T)] Qnix) dx 0 . 

If /(;r) Fn{x) had not more than n changes of sign, it 
would be possible to construct a Qn (x) having the same 
sign as /(x) — Pn (x) at all points where /(x) — P,i (x) + 0, 
and this would make the integi'al positive, unless q (x) ~ 0 
wherever /(x) — Pn(.r) + 0. The condition that Ph (:r) be 
such as to give the remainder the requisite number of changes 
of sign is of course not sufficient for the solution of the 
least-square problem, since it is satisfied by any polynomial 
differing sufficiently little from the minimizing polynomial. 
A corresponding formulation is possible in the case of ap- 
proximation by trigonometric sums, with or without a weight 
function. 

The least-square problem, in general as well as in the 
special cases discussed in the preceding paragraph, is that 
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of choosing from a given linear famil}’ of functions the partic- 
ular one which is closest to another given function, according 
to the measure of discrepancy specified. The geometric inter- 
pretation that has been suggested gains in cleaniess if each 
function, instead of being represented merely by a point in 
a space of an appropriate number of dimensions, is also 
represented alternatively by the vector from the origin to 
the point in question. The linear family of functions then 
corresponds to a linear spread in the geometric picture, and 
the function of closest approximation coiresponds to the point 
of this spread whose distance from the point representing 
the function to be approximated is as small as possible; the 
residual function f{x) — y (j-) (or h(I,j — «•(/.)) can be regarded 
as the vector from 9 (.r) to f(x) (or from ir (/.) to h (A-)); and 
the property of “orthogonality” characterizing it is associated 
with the fact that the shortest distance from a plane spread 
to a point outside it is perpendicular to the spread. The geo- 
metric terminology thus acquires additional plausibility. 

2. The general notion of angle; 
geometric interpretation of coefficients of correlation 

An obvious further step is to proceed from the notion of 
orthogonality to a general notion of angle in function space. 
As will be seen presently, the definition of angle is already 
implicit in that of distance, if the frame of Euclidean geo- 
metry is to be fitted consistently to functional relations, 
though the question as to the possibility of carrying the idea 
through systematically and witlmut danger of internal contra- 
diction still calls for some elucidation. 

It will be well first to adopt a common notation for dealing 
simultaneously with functions of a continuous variable and 
functions of a discrete subscript. lfr(<), 1/ (f) arc continuous 
functions of t for a t< b, let 



If a;,, a;*, • • • , Xn, are two functions of an index A 

which ranges from 1 to «, let 
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n 

(x-ij) '^kyk- 

k - 1 

The quantity (x . y) is the inner or scalar product of the two 
functional vectors. In particular, {x-x) — ^Xk or 
etc. The condition for orthogonality of x and y is that 
{x- y) =0. A third ease worthy of explicit mention is that 
of infinite sequences ai, xa, • • •; j/i, ya, • ■ such that 
and convergent. The interpretation then is that 

00 

{x.y) —^xkijK- 
fc 

All three are of course isolated particular cases from the 
point of view of general analysis. 

Let x{(), i/(J), or j (A:), y{k), be two functions represented 
by the points P, Q respectively, and let 0 be the origin, 
corresponding to a function which vanishes identically. By 
the convention already adoi»ted, the sides of the triangle OPQ 
are 

OP^-=r. U.x), OQ‘-^-(y.y), 

fV ^ ((.'/ — ■<■)• (y — *)) = (y-i/) — 2(x-y) + ix-x). 


If the angle POQ is denoted by fl, application of the law 
of cosines to the triangle gives 

- - PP^-i- OQ^ — 2 OP • 0(^ cos fl . 

whence 207*-0(^cos« 20 -y), and 


cos fl 


_ 

(x . xyi'- iy • ■ 


When the independent variable is A -— 1 . 2 . the 

formula becomes 

Xk yk 


cos fl - 




This is recognized immediately as a fundamental formula in 
the theory of statistics, /f* is thv coefjicimi of correfatiun 
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between the variables x and y, if these have been reduced 
to the form of deviations from their respective arithmetic 
means, or, in other words, if = ^yk — 0. Subsequent 
pages will show that the geometry of function space throws 
much light on the stnicture of more complicated formulas of 
correlation. 

It becomes important to inquire more closely as to the 
logical basis for the identification of analysis and geometry. 
The essentials of such a basis are implicit in the considerations 
leading up to the proof of Theorem III in Chapter 111. To 
repeat in the present notation what is needed for the purpose 
in hand, restricting attention at first to a two-dimensional 
spread, let x, y be any two functions which are linearly in- 
dependent over one of the ranges specified in the third para- 
gi'aph preceding (or over some other appropriate range); if 
the range is an interval, let it be supposed for convenience 
that the functions are continuous. Let 


X, 




y— ?i 


U.-j^ 


It follows immediately from this definition that (?i*>/i) — 0. 
By the hypothesis of linear independence, furthermore, it is 
certain that iji is not identically zero. If 






the functions 'i, y are orthogonal to each other: - 0; 

and they also satisfy the condition that (?•?) = (vfl) — 1- 
They are linear combinations of x and y, and it is seen at 
once that the determinant of the coefficients is different from 
zero, so that x and y conversely can be expressed as linear 
combinations of $ and y. 

The functions x and y being given, let § and y now be 
any pair of functions such that x and y are linearly expressible 
in terms of ^ and y, and such that 


(? . y) = 0, (? • I) (t/ • 7) 1 . 
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It has been shown that such functions S, ^ can be constructed. 
There are infinitely many pairs .satisfying the requirements; 
the original can be replaced by + 
ii fii, "i, fit are any constant.s such that 






Furthermore, the requirements can be satisfied even if x and y 
are not linearly independent; it suffices then to express y 
in terms of two linearly independent functions xx,yx, and to 
construct functions ?, as above with Xx and yx in place 
of X and y. 

In terms of a chosen pair of normalized orthogonal 
functions ?, //, let 

Then 

(x • X) ^ (/'i + hj, {y*y) = (tl + fco, 

{{y — x)-(y--x)) = — 

If the functions x,y are thought of us corresponding to the 
points Py Q, with the coordinates (di, hx) and {a^, h^) respectively 
in a rectangular coordinate systeyn, while 0 is the originy the 
quantities {x>x)y {y*y). and {{y — x)-{y — x)) are the squares 
of the distances OP, ()Q, ayid PQ. The cosine of the 
angle POQ is 

aiUi + lahi 

More geyierally, if 

u kxX-\- pxy c - ltX-{- Pty - + 

are any linear ronihinations of x and y, and if My S are 
the corresponding points (.4,, i^i), (.4a, W*), thm {u*n)y (r-v)y 
and ((r — a) • (i — n)) are the squares of the distances OM, OSy 
and MS, and the quantity {n-v) is tlte scalar product of the 
rectors OM, OS. Every linear combination ^x-{-fiy corre- 
sponds to a definite point with coordinates (a ai a ^ , ^ h ) ; 
if X and y are linearly independent, a one-to-one correspondence 
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is established between 1‘iinctions linearly dependent on them 
and the points of a plane, in such a way that there is an 
actual identity between the measures of distance and angle 
and the (piantities which were associated with them originally 
by analogy. So far. moreover, theie is ito need of any space 
of more than two dimensions. 

The correspondence being once established, the analytical 
relations expressing geometric facts are implied with logical 
conclusiveness by the geometric, facts themselves. Consider 
for example the i)roblem of determining a constant A to mini- 
mize {{y — — A.r)), when j‘ and y are given. Jf j 

add y are interi>reted as statistical variables, re])resenting 
deviations from the respective arithmetic means, / is the 
coefticient of regression of y on ./*. The points corresponding* 
to x and y being P and as before, the function A./* is 
represented by a point M on the line ()I\ and y — A.r corre- 
sponds to the vector in the definite sense that when 
y — Aj- is expressed as a linear combination of ? and //. 
the coefficients of S and // are the components of the vector. 
It is clear from the geometric figure not only that MQwwx^i 
be perpendicular to ()I\ or in other words that y -lx must 
be orthogonal to x, but also that the value of A which 
accomplishes tin' purpose is given by 


A = 


i.op ^ ()M f)Qi\o:so, 
(.r.. (,T.r)'M'/-y)''- 


• ?/) 

(r • x) ’ 


where e is the angle P<)Q, and furthermore that the minimum 
value of ((// — — Aj;)) is 


MQ* -= (OQ nine f (y . */) (1 — cos* tf) (y-y)(l — r*). 


if cos 6, interpreted as a coefficient of correlation, is denoted 
by r. 

For the geometry of three (linearly independent) functions 
!/, z, let ?i, tlx, '4, >1 be defined as before, and let 




, _ i-. Piii'X) 




(Vi ’ Vi) ’ 
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Then satisfy the conditions 

(?.//) = = --- 1 . 

and j',y^ z are linearly expressible in tcmis of them. There 
are infinitely many sets of functions satisfying these same 
conditions, and serving equally well for the representation 
of x, y,r:, any such set being expressible in terms of the 
l)articular set f}, 'i by means of the coefficients of an orthog- 
onal linear* transformation i]i three variables. Any linear 
combination n = /i jy + vz can be expressed in the form 
Bfi + , and can thus be put in correspondence with 

a point (A, B,C), the square of whoso distance from the 
origin is the quantity (u • /r) — A^~\- B“-\~ If n and i 
are two such linear combinations, corresponding to the points B 
and Q, the cosine of the angle POfi is {n • • u) (?; • r)y -. 

The representation of functions by points or vectors is 
particularly (*onvenient for the visualization of coefficients 
of partial and double correlation. Let ./■, //. z be three given 
functions, corn'sponding to points P. Q, li in three-dimensional 
space. If a", //, z are sets of deviations from arithmetic means, 
so that the statistical terminology is appropriate, the co- 
efficient of partial correlation between ./• and y, when z is 
held fast, is the coefficient of correlation between .r — ?^z and 
y — where A and y are the regression coefficients of ./• 
on z and of y on z respectively. The function j — I z is 
a linear combination of x and orthogonal to its geo- 
metric counterpart is a vector in the plane POii, perpendicular 
to OR. Similarly, y — yz is represented by a vector per- 
pendicular to OR. and lying in the plane QOR . The angh' 
between these vectors measures the dihedral angle formed 
by the two planes. So the coefficient of partial correlation 
in question is the cosine of the dihedral angle. Let Pi , Qx , 
Pi be the points in which the rays OP, OQ, OP pierce the 
unit sphere about the origin as center. In the spherical triangle 
Pi Qi Rit let oc^ be the measures of the angles Pi, Qi, 
Ri respectively, and let a, b, c be the sides opposite these 
angles. Let ri2, /'is, rss respectively be the ordinary cor- 
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relation coefficients of x and y, x and 2, and y and 2^; let 
ri2.8 be the partial correlation coefficient which has just been 
discussed, and let 7*18/2 and 7*28.1 be the other coefficients of 
partial correlation. It has been seen that ri2.8 — cos y . In 
the same way, 7*18.2 = cos /?, r^sA cos a , while 7 ,2 cos r, 
7 i 8 = cos b, 7*28 ^ cos a. By the law of cosines, 

cos c — cos a cos b 

cos y — ^ : — r > 

sin a sm h 

which means that 

7*12 ^13 r ^s 

This is the standard formula expressing a coefficient of partial 
correlation in te'trns of ordinary correlation coefficients. The co- 
efficients 7*18.2, 7 * 28.1 of course have corresponding expressions. 
The inverse formulas 

7 'l 2 . 8 "i" 7 * 13.2 7 * 23.1 

etc., are similarly obtained from the polar triangle. 

For the definition of one of the coefficients of double cor- 
relation, parameters A, n are to be determined so as to mini- 
mize the quantity {{x — Xy — fiz) • {x — ly — nz)), or in 
other words to give the least-square approximation of x by 
means of a linear combination of y and This requires that 
x — Xy — yz be orthogonal to both y and geometrically, 
the point M representing the combination Xy-\-fiz is the foot 
of the perpendicular from P on the plane QOR. Then 
the coefficient of double correlation between x and the pair 
of variables y and z, is the coefficient of simple correlation 
between x and Xy-\-iiz,\hQ cosine of the angle i/OP between 
OP and the plane QOR. Let this angle be denoted by h. It 
is measured by the arc P, if 6 ’, is the foot of the altitude 
from Pi to the side Ri in the spherical triangle. Hence h 
may be calculated as a side of the right triangle R\ S^ P\, 
by the formula 


sin k = sin b sin r- 
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By substitution from the relations 


sin*i» = 1 — rjg, 


sinV == 1 — »'w .8 
it is found that 

/'1.28 = cos h = {1 — 


1 y - >-2 

^ 'l2 'iB 


^8 “ I ” ^ ^ 12 ^18 ^2 


( 1 _, 2 )( 1 _^ 2 ) 


sin* hy^ 


(^2 + il 


18 






There are corresponding formulas for the other double cor- 
relation cofficients r2.i8 and rs.i^. 

The same figure may be used to obtain formulas for the 
coefficients t and fiy the partial regression coefficients of 
X on y and z. Let lines be drawn through M parallel to 
OR and OQ, meeting OQ and OR oX. K and L respectively, 
to form a parallelogram OK ML. The vectors OK and OL, 
constituting a resolution of OM into components collinear 
with OQ and ORy represent separately the terms Xy and (az. 
On the surface of the sphere, let the arcs Ri Si and Si Qi be 
denoted by d and e, so that d measures the angle LOM, 
while e measures the angle MOK and its equal OML. (The 
formulation is adapted throughout to the case in which the 
point M falls within the angle QORy so that Si is interior 
to the arc QiRi.) In the plane triangle LOMy having two 
of its angles equal to d and e respectively, the third angle, 
at Ly is the supplement of d~{-e. But the arcs d and e on 
the sphere make up the side a of the original spherical triangle. 
So the law of sines in the plane triangle gives 


OL sin e sin e 

OM sin {d -f e) sin a ' 


In the right spherical triangle Pi Si Qi , on the other hand, 

sin c cos 

sin e — 

so that 


OL 


cos h ^ 
OMsinc cosfi 


sin a cos h ' 


11 
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while ()L = ft-OB, and OM=OPcosh. Hence 


OL 
== OR 


OP sin c 
OK^ina 


cos/?, 


or with the use of a conventional notation, presently to be 
explained at greater length, for regression coefficients and 
standard errors of estimate, as well as that already used for 
coefficients of correlation, 




Similarly, 


O '!. 2 
08.2 




^^ 12.8 


OP Sin b 

7777-; — cos r 

O Q sin a 


<0.8 

0 ' 2.8 




The results of the preceding paragraph can be obtained by 
a possibly less straightforward but more exclusively geometrical 
method, making no use of the spherical triangle or of spherical 
trigonometry as such. Let the letters O, P, Q, P, i¥. A', L 
have the same signification as before. The letters u, 6, r, 
((f r retain their previous meanings as measures of the 
face angles and dihedral angles of the trihedral angle 0 PQR, 
Let H be the foot of the perpendicular from M on OQ, and 
J the foot of the perpendicular from R on OQ, The triangles 
KMH and ORJ in the plane QOR are similar, since KM 
is parallel to OP (having been so constructed), is parallel 
to P/ (both being perpendicular to OQ), and KH and OJ 
are collinear. Furthermore, K M and OL are opposite sides 
of a parallelogram. So 

OL _ KM MH 
OR ~~ OR ’ RJ ' 


But the length of PJ, the perpendicular from P 011 OQ, is 
OPsina. Also, PH is perpendicular to OQ, since MP, being 
pei*pendicular to the plane QOR, is perpendicular to the 
line OQ, and OQ, being perpendicular to MP and MH, is 
perpendicular to their plane and to the line PH in that plane ; 
hence MHP is the measure of the dihedral angle so that 
MH—HP(^ 09 >fi (as PMH is a right angle), while HP 
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--- OFsinc, making MH = OPsinccos/^. The resulting 
expression for fi is the same as before. 

The preceding calculations are not restricted in substance 
to the case of statistical variables, but can be formulated 
without the terminology of correlation, and are then applicable 
to any functions x, y, z coming under the original hypotheses. 
It is to be emphasized also that for the time being no use 
has been made of space of more than three dimensions; the 
geometry is the actual geometry of experience. 

3. Coefficients of correlation 
in an arbitrary number of variables 

For dealing with relations of higher complexity it will be 
convenient to modify the notation somewhat. When there 
are m functions to be considered, let them be denoted by 
• • •, Xm. The case of primary interest will be that of 
statistical variables, measured from an arithmetic mean in each 
case; i. e., xi will stand for a set of numbers xn, Xt2, • • •. Xmr 
subject to the condition that a:,i+ ^,2+ • • • +Xin 0 . Apart 
from technical notation and terminology, however, the work 
will be valid for functions of any of the types previously 
considered. As an additional item of notation, (x • x) will 
be abbreviated to ((a;))*. 

The reduction of general coefficients of correlation and 
coefficients of regression to expressions in terms of coefficients 
of lower order depends on the following fundamental pro[)- 
osition : 

Let ^2. • • •. be determined so (is to minimize 
((a - ^2 .r. — . . . — 
and ivith these values of the A’n, Id, 

w .r, -- ■ — A„, 

Let /tg, . . . , and ps? • • •? Pm he determined so as to minimize 
((^1 — /^3 ^8 — • • • — ym Xm)Y and ((xg — r., x^ — • • . — 
and let 

(p Ti /tg .Tg • • • ymXfn , ift .7*g — J 'g ,i\^ — . • . — v .t , 


11 
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Ijei he deterinined so (is to minimize (fy — yf Then 

f») (/• — yi iff , 

The proof depends on the still more fundamental fact, imme- 
diately deducible from the definition of orthogonality, that 
if one fundioyi is orthogonal to each of two othei' fmctions, 
if is orthogonal to every linear comhinatiofi of them. 

It is at once apparent from the definitions of ^ and ify that 
f/> — / is a linear combination of the :r's, of the same form 
as o)^ the coefficient of Xi being unity in each case; the 
question at issue is that of the identity of the remaining 
coefficients. 

By a theorem discussed early in the chapter, a (necessary 
and) sufficient condition characterizing the coefficients in o) 
is that o be orthogonal to each of the functions - ^ 

By the same theorem, (/ is orthogonal to each of the 
functions j\\, •••.am, and ip is likewise orthogonal to each 
of these functions. Since is orthogonal to 9 / and to ip^ 
it is orthogonal to the combination f/ — y/ifj. The same is 
true of • • • , In other words, f/> — ./i/^ / 6 ‘ orthogonal 
to earh of the functions xs, • ♦ *, a’m* 

By one more application of the theorem, — ./ip is 
orthogonal to ip. But can be expressed in the form 

•'i ipfr:^X:\f h^m^m. Conscqueiitly, being orthogonal 

to ip. - ^ if — y/xp is orthogo7ial also to 

The identity of if — ./ip with is thus established. 

For m 3, the proposition is ecpiivalent to a relation of 
perpendiculars which is important in deriving the formulas 
of spherical trigonometry: if OP, OQ, OP are three rays 
issuing from 0 , if .Y is the foot of the perpendicular from 
on the line OP, if a line is drawn through in the plane QOH 
perpendicular to OP, and if is the foot of the perpendicular 
from F on this line, then if P is perpendicular to the plane QOli. 
This figure yields the equation sin/i sin& siny, which was 
used in obtaining the formula for a coefficient of double 
correlation. Essentially the same configuration appeared also 
in the discussion of partial regression coefficients. 
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For the statistical application with an arbitrary value of m 
a somewhat elaborate notation is appropriate. Let the func- 
tion w, as originally defined by the formula xi—hiX2^ 

according to the least-square criterion, be denoted by 
It ma}’^ be spoken of as the residual of Xx tviih respect to 
X2y ••• ,Xm* (The order of the subscripts 2 , 3 , • • •, m 
among themselves is clearly immaterial.) Similarly, 9 and tp, 
the residuals of Xi and of with respect to xiiy^--yXm7 
are to be denoted by xi.s m and X2.ii..^m- The standard 
deviation ffk of any one of the original variables Xk is defined 
by the equation = {{xj)y/n. (The presence of the 
denominator w is a mere matter of definition, as far as the 
present discussion is concerned, since the equations will in- 
volve only ratios of standard deviations; except for the 
statistical interpretation, the (positive) square root of ((x))* 
itself may be used in place of ak.) The correspondingly 
defined standard deviation of such a residual asxi.2a . m? the 
standard e^'ror of eMimate of x\ in tei'tns of X2, is 

denoted by m- The partial regression coefficients 

^2, hx •••, Im are represented by 612.84 . m, 618.24 . m, *•*, 
6im.28. - m-i. The first subscript of each of the 6’s is that 
of the variable approximated, the second is that of the 
particular variable to which the coefficient in question is 
attached in the regression formula, and the other subscripts, 
the order of which among themselves is without significance, 
are those of the remaining variables. In the same way, 

are to be replaced by ^ 18 . 4 - ■ - m j * * * ? ^lfn.8- • -m— 1 j 

and 

Pm by 623.4 . •my ' • 'y 627 n..S.. -m— 1 • The coefficient 
of correlation between y and <//, or, in the present notation, 
between Xi.s- .m and X2.3.. my is the coefficient of partial 
correlation between xi and xo when Js, • • Xm, are held fast, 
and is represented by ri2.3. .m. 

In the new notation, the general proposition about the 
identity of the functions previously called w and tp — 
asserts that xi.28. m is the same as the residual of xi.s . m 
with respect to X2,8.. .m. It is important to bear in mind 
however that its essential content is independent of the 
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number of variables and their individual designations, and 
this essential content is more adequately though less concisely 
expressed by saying: 

The residual of a f/iven fitmtion with respect to a set offunc- 
timis mat/ he obtained by calndating mtressively the. residuals 
of the given fnnction and of any chosen function of the set 
with respect to the remaining functions of the set, and then 
taking the residual of the former of these residuals with respect 
to the latter. 

An ealier paragraph contained a derivation of the value 
of the simple regression coefficient minimizing the expression 
{iy — -(y — ^x)). The formula of that paragraph which 

reads 

j _ iy-yY- (^-y) 

“ (x.-vy^Hy-gy-^ 

becomes in the present notation ^ (pjai) /•,*. Interchange 
of the variables gives bn - (^1/^4) ri.; the simple coefficient 
of correlation is symmetrical in its two variables, so that 
til = fit, while bti and bn are different. Incidentally it 
appears that /i* = biift-. 

Corresponding formula.s for partial regression coefficients 
can be obtained immediately. It is apparent on inspection 
that while the coefficients of x*, •••, Xm in the function 
Xi.ti-. in = ">, as expressed in the fora y — ar® con*’ 
binations of the /t’s and the r’s, tlm coefficiml kt — bu.9 .» is 
merely .ri. But this ./ is the simple coefficient of regression 
of y with respect to and as such is expressible in terms 
of the standard deviations of y and tp and the correlation 
between them. The standard deviations of y and «/< are 
m and «!.» -m respectively, and their coefficient of cor- 
relation is >12;, ..,u. Consequently 

, ^1.9- .-m 

012 9. m — 

®2.8. • m 

By interchange of subscripts — in other words, by consideration 
of the coefficient of regression of X2 9...m with respect to 
it is found that 



V. GEOMETRY OF FUNCTION SPACE 167 

' F12.8. -m? 

01.8- -m 

the definition of the coirelation coefficient is symmetrical 
with respect to the first two subscripts. Combination of the 
equations for the 6’s gives 

ri2 8--.m ' (&12 3** m ^^21.8* • • 

In the earlier paragraph to which reference was made above 
the minimum of ((// — — kx)) was evaluated in the 

form (y-y)(l — r*). In the new notation this result is ex- 
pressed by the equation cr| ^ <r|(l — r^J. The corresponding 

equation with subscripts reversed is o o^i(l — Applied 
to the standard deviation of w — 9* — regarded as the 
residual of </> with respect to </', it becomes 

28.. m 8 -m 

Here again it is important to recognize the essential content 
of the formula, as distinguished from the notation in which 
it is expressed. Written down successively for cases of in- 
creasing complexity, with a particular choice as to the dis- 
l)osition of subscripts each time, it yields 

®I.2= — 

~ S2 “■ ‘^12^^ His. 2 ^' 

*’^1.284 “” *^1.428 '' ®^r.28(^ 'l4.2:l^. 


*^12S . m ■ ®r »»28.-.»i-l 2H -wi — -.m 

and by combination of all these equations 

O'f/iS. m—~ O'! (1 /■i2)(l '/'f# 2)(l j'l4 29 ) ' • • (l — >'lm.2S.- «i— l) • 

Reverting to an abbreviated notation for certain residuals, 
but with a modification of that previously employed, let 

— X\.u -m* — ■ X2.H4- ?n * 


yi — a*! 4. -7/1* 


^2 •r’2 4* •?n* 


9 'a ■ — = Xs.4 ••m* 
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By the fundamental theorem about residuals, Wi maj- be re- 
garded as the residual of yi with respect to y*, and w* is 
tlie residual of ya with respect to ys- Hence Ow voefficient 
of correlation betivcen oh and Wg is the coefficient oj partial 
correlation heitceen and trhen is held fast, and is 
expressed in terms of the simple correlations between <jpi , ya 
and ya by the formula previously obtained in the discussion 
of correlations among three variables. If the con*elation 
between and (o^ is denoted for the moment simply by r\ 
and if the correlations between yi and y-. yi and ya? and y^ 
and v’3 called respectively r'12, na and ria, then 

/ • / / 

, _ i'l2 — 

But from the point of view of the dependence of the,se 
quantities on Ji. x>, Xm, 

ri2 4 . WH ru ri3 m* ^*28 >‘2H m* 

ri 2 . 4 ^-- m Ha . I w^‘ 28.4 • m 

By the last equation any partial correlaticii coeflicient can 
be calculated in terms of correlation coefficients of lower 
order, that is to say, correlation coefficients' involving a smaller 
number of variables, and so ultimately in terms of ordinary 
correlation coefficients. 

Let the partial regression coefficients of Xi with respect 
to xa, •••, Xm be denoted once more by h, 
and let ® — ^2 + •'r;i+ ••• -\-^mXm- The coefficient 

of correlation between x, and ® is the iwfjkimt of multiple 
(orrelatfon between *1 and the net of rnriahles x->, •••• Xm. 
It may be represented by Cl. M m> If the regression coefficient 
of iCi on © is L, the square of the standard deviation of 
Xi — L® is ff*(l — But any constant multiple of © 
i.^ a linear combination of and by the definition 


/•' “ •/•l2 »4 -m. >’12 — 

and consequently 

^*12.81 • m = 
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of <D no other linear combination of x>i, •••, can give so 
good an approximation to Xi according to the least-square 
criterion as ® itself. Hmce it mmt be that L 1, and the 
standard deviation of a:, — LQ> is the standard deviation of 
— (P, already denoted by ai a,. so that 

Taken in conjunction with an equation previously obtained 
for this shows that 


1 '•ra.-.n- d 4)(J 'Ymaa.. 


ii relation fi'om which tlio coefficient of multiple correlation 
can be calculated. 


4. The geometry of frequency functions 

Apart from the generality which the preceding account of 
the application of geometry to analysis possesses by virtue 
of the fact that at the outset the independent variable may 
be one taking on a finite number, an enumerable infinity, or 
a continuous infinity of values, its substance can be given 
still another setting. In the statistical case, if there are for 
example just two functions concerned, av, -- jrik) and t/h~ !/(kl 
the formulas involve the different values of k symmetrically, 
and are not affected if the n pairs of numbers , //a) are 
permuted, each pair by itself being kept inviolate. That is 
to say, the independent variable serves only to define the 
association of a value of a- with a value of y, and has no 
further significance of its own% For dealing with large 
numbers of observations, the problem may be idealized by 
supposing that (within limits, perhaps) any value of .r may 
be associated with any value of y, but that some pairs of 
values (a-, y) occur oftener than others, to an extent indicated 
by a frequency function H^U',y), whose integral over any 
region of the .r, y plane measures the frequency of pairs of 
observations (j*, y) falling within that region. If and y 
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are measured as deviations from their respective means^ this 
fact is indicated by the conditions 



x^(xy y) dxdy 


^ 0 , 


Xf 


y<p{x, y) 


dxdy 0, 


the integrals being extended over the range of definition of fp . 
The squares of the standard deviations of x and y are then 


y)dxdy y) dxdy 

^ ^ 9 {:x,y)dx dy ^ ^ ^{x,y)dxdy 

and the coefficient of coiTelation between them is 


J* J xy(f{x, y) dxdy 


[.fj* x^<p{x, if <f{x, y) dx dy 


\\n 


To illustrate the geometry of frequency functions, let the 
case of three dimensions be chosen. Let (pix^y^z) be a non- 
negative continuous function of its three arguments, to be 
regarded as a frequency function for the occurrence of the 
set of values {x^y^z) for three measured variables. To ob- 
viate the necessity of convergence proofs, let it be supposed 
that (p is different from zero only over a finite domain of 
three-dimensional space. It may further be supposed without 
essential loss of generality that the triple integral of (p over 
the domain where it does not vanish is equal to unity— in 

abbreviated symbolism, j (f> = 1 — which means that all 
frequencies are referred to the total number of cases as 
unit. The assumption that a;, y, z are measured from their 
arithmetic means is equivalent to the set of equations 

Jx(p ^ Jyfp - Jz(f> 0; no use will be made of these 
equations, except to justify the statistical terminology em- 
ployed. The squares of the standard deviations of x, y, ^ 

are f X since the quantity J*y, which 
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would othenvis(! appear as denominator, is 1 . The coefficient 
of correlation between x and y is ^xy y/[ ( J ./•* If] ( J y* y>] |''' . 
and the other coefficients of correlation are correspondingly 
defined. 

As a first step toward the setting np of a geometrical 
representation, let auxiliary notation he introduced as follows: 



In these formulas A', 1'. and Z, as well as y. are to he 
thought of as functions of x, y, and z, and the integiatiou 
as performed with regard to these variables. As the func- 
tional determinant of A'. 1”, Z with respect to .>■. y. ■ is 1. 
however, the integrals may ccpially well be taken with regard 
to A', r, Z, if the limits of integration are suitably adjusted, 
or. what comes to the same thing (since each integrand is 
identically zero except over a finite region), if the integralh 
are extended over the whole of space. As a function of 
A’, 1', Z, let y(.r. y, be denoted by 9', (A’. Y. Z). Then, 
with A', r, Z as variables of integration, 

i 

Let 

A7«/i, 7i I’.T,. i', -- Zi(i>i. 

and let the quantity y, z) - Oi r, o), (A, V. Z), as 

a function of the new variables, be represented by (Pi (?i , ?i, Ci i. 
In terms of S, , , ti as variables of integration, the relations 

J's,.,,®, J 




Al v, J XZvi J YZif, - 0. 


U 
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are satisfied, together with the equations 

J / //?®t — J C? a>i = 1, 

while J*J*J*v(‘*> y* z) dx dy dz — 1 goes over into 

JTJ ®‘ 1 • 


If '»! S'l + <»i «?i + Cl and «» ?i + hi »?i + c, are any two 
linear combinations of 

J* 5i+ hi + Cl fei)(a* + h* »/, + qi ti) ®i = «t a*+ h, h* -|- CiCi. 

Finally, for the sake of generality, let be any 

variables expressible in terms of ?i , »yi , Ci by means of a 
(normalized) orthogonal transformation. Let ®i (?i , ^i , fi) 
— . a)(|, t). As the determinant of the transformation is 

± 1 , the equations of the preceding paragraph, including the 
last one as formulated with the various sets of coefficients 
involved in the transformation, give 



J (D =J|*© = J»/a) -Jc*© 1. 

the variables of integration now being C. The basis 
of the geometrical interpretation is the possibility of finding 
11 , C as linearly independent linear combinations of x, y, z, 
so that these relations are satisfied. 

The equations expressing in terms of x, y,z mani- 

festly are in fact linearly independent, and x, y, z consequently 
can be linearly expressed in terms of ?, fl, C. Furthermore, 
apy linear combination of x, y, z can be similarly expressed. 
Let 

U — «i a; -f- j/ -f- 2 “ .-li # -1- Bi tj -f t\ i 


be any such combination. Then 

dxdydz --- / ^ + 0 * (Dd^di) dC 

— ill 4* 4* C? . 
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If 

UaX + fity-i-ytZ = i4* ? + i? + C* C 


is any second combination of the same form, 


m 


VVff dor dy dz 




Let P be the point with coordinates (ili, Bi, 6i), and Q the 
point (Ai, B 2 f Ct)y with reference to a system of rectangvlar 
arcs in three dimensions^ and let 0 be the origin. Then 

J U*ff‘ and the variables of integration being x, y, z, are 

the squares of the distances OP and OQ, and 

J'c- !>/[(/[;■,,) (Jk-,)]'", 

the coefficient of cm relatmi betweeti U und V, is the cosine 
of the angle POQ. Through the medium of the equations 
expressing x, y, z in terms a/ each of the variables 

X, y, 2, and every f inear combination of themy can he associated 
with a definite point in three-dimensional space, in such a way 
that standard deviation and coefficient of correlation have the 
same sort of geometrical meayiing as before. From this begin- 
ning tho geometrical structure of the theory of correlation 
can be built up along the lines previously followed. 


5 . Vector analysis in function space 
There is further scope for the application of simple geometric 
ideas in functional analysis, where the complete picture calls 
for a geometry of infinitely many dimensions. For a single 
illustration (discussed by Leyy, Le^cms d' analyse fonctionnelle, 
Paris, 1922, pp. 127-128), consider the integral 

O .. Sliy) -=£V(x-,y,y)rf-/-. 

where y is a (suitably) arbitrary function of jc,y — /(x). 
and F is a given function of its arguments. Not to enter 
into details with regard to questions of continuity, let it be 
a.ssumed that all the functions that appear in the discussion 
have as many continuous derivatives as are needed to justify 
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the operations performed. If y is looked upon as a point in 
a function space of infinitel;^ many dimensions, or as a vector 
from the origin to the point, S2{y) is a scalar point function 
in that space. Subject to the appropriate conditions of diffe- 
rentiability, let ij{x) be an arbitrary function vanishing at 
a and at b, and h an arbitrary constant. The familiar process 
of differentiation gives 

["iV ^ X 

^ X ^ 2/, y') ~ Fy' {x, y, y'^dx, 

the last expression resulting from an integration by parts 
performed on the second term in the brackets, with use 
of the fact that ij (a) ~ i]{b) = 0. Let the function 
Fy — {d/dx)Fy' be denoted by f(x). Then the expression 
for the derivative may be abbreviated to ^yyidx, or, in 
a notation previously employed, (y-y). The variation hti may 
he regarded as an infinitesimal vector increment of the vector y 
in function space, of geometric magnitude If the 

increment of Si is divided by this quantity, instead of li, 
passage to the limit gives a directional derivative in the 
direction of the vector^. Its value is {y ■ y>)l{y • But 

this can be written in the form 

u . 

from which it appears that the direciional derivative in the 
direction ^ is equal to the quantity (y • multiplied hy the 
cosine of the angle between the vectors ^ and f , being greatest 
when is collinear with y. The ^Afunctional derivative^' f, 
considered as a vector in function space, thus has the character' 
^f ^ gradient, of the scalar point function ^ . 

The functional derivative can also be obtained formally 
after the analogy of the ordinary representation of a gradient 
in terms of a rectangular coordinate system. Let it be 
supposed now that y ~ f{x) itself vanishes at the ends of 
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the interval (a, h), and that it is expanded in a series of 
normalized orthogonal functions nuioc), each of which likewise 
vanishes at the ends of the interval, in the form 


fir) — ai », (x) + rts M, (3;)+ • • •. 


TiCt it be assumed further that this series admits differen- 
tiation term by term. The integral is a function of the 
infinitely many variables («,, «*, ...), the coordinates of the 
point y with respect to a set of rectangular axes in space 
of an enumerable infinity of dimensions. The derivative of ii 
with regard to au is 


i' 

u a 


daii 


!/• y) dr 


‘ X 'J' y'^ J'V y> y') X* (^'01 dr 

„ [Fui.x,y,y')— . fy'(r,y.y')]nk{r)dr 



ff{,u^Uk(r)dr. 


This is the Fourier coefficient of y (o’) with respect to itk{r). 
The expression 




dSl 

dak 


«fc (at) 


for a rector huviny the component dill dak in the direction 
of the corresponding coordinate axis is the formal expansion 
of the functional derivative y according to the orthogonal 
system (u^). 

An exposition of other elementary develo])ments of the 
vector analysis of function space, rendered concrete by the 
use of theorems on the convergence and degree of convergence 
of certain expansions in series of orthogonal functions, has 
been given by the author elsewhere (Annals of Mathematics, (2), 
vol. 27 (1926), pp. 551-567; Bulletin of the American Mathe- 
matical Society, vol. .32 (1926), pp. 641 643). 
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